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[57] ABSTRACT 

A method and apparatus is disclosed for digital multipli- 
cation based on sums and differences of finite sets of 
powers of two. It is observed that for a given multipli- 
cand signal A, multiples of the form A-P/Q can be 
defined by adding or subtracting term signals when 
each term signal (T|,Tj) is selectively made representa- 
tive either of the multiplicand (A) multiplied by a 
power of two (T=A*2-0 or representative of a nuUity 
(T=0*A). A mapping unit is provided for controlling 
responsive barrel shifters and for controlling one or 
more responsive adder/subtractor units so that the re- 
sultant system has a transfer function equivalent to that 
of an n-bits by m-bits multiplier. 

38 Claims, 7 Drawing Sheets 



UOa 



90 



2^ 



« 1 



n+0 



109 



n 



» 1/2 
108 J^lOl 



n+1 



•» 1/4 



n+2 



103 



« 1/8 

IT 



n+3 
-7^ 



n - BY - 3 BIT MULTIPLIER 



100 



A « 2"* OR 0 



/^120 



-UOo 
115 
125 



n+3 

4- 



A « 2 J OR 0 



I30a^ 



y-"130 



— 7* 
n+3 



1200 



> E 

/- 



n+3 



130o 



SIGN SELECT 
130x 



^I20x 



100^ 



^401 



150x 



7 

150' 



TABLE - IB 
MAPPER 
(DECODER) 



I 
J 
K 



09/30/2004, EAST version: 1,4.1 



U.S. Patent 



Mar. 28, 1995 



Sheet 1 of 7 



5,402,369 




09/30/2004, EAST version: 1.4.1 




09/30/2004, EAST version: 1.4.1 



U.S. Patent 



Mar. 28, 1995 



Sheet 3 of 7 



5,402,369 



Of 
U 



-J 

H 
1— • 

n 
u 



I 

c 



o 
o 

CO 




CO 
o 



in 

CO 



□ 



cu 

3K 




a 

1 

cu 

< 
11 



o 

CO 



o 
cu 

CO 



in 
cu 

CO 



/ 



X 
o 
cu 

CO 



o 

CO 



en 
cu 



c5 
o 

CO 



o 
CO 

c 



cu 



\ 



c 



d J2 

CU cu 

o o 

CO CO 



CO 



Nc 

CO 
o 
CO 

/ 



\ 



o 
CO 



U "g \ 



o 

CO 



00 



\ 



CO 

6 

h-H 




X 
o 
in 

CO 



09/30/2004, EAST version: 1.4.1 



U.S. Patent 



Mar. 28, 1995 



Sheet 4 of 7 



5,402,369 





ID 

- ^ r. 

8: 5 S 

<C ^ ^ 



O 



Hi' 



X 

o 
in 



09/30/2004, EAST Version: 1.4.1 



U.S. Patent Mar. 28, 1995 sheet 5 of 7 5,402,369 



Dcvla-fclon From True 8 Bli Accuracy 



o 
o 



CO 
o 
o 



o 
o 



o 
o 



o 
o 



o 
o 
o 
I 



CO 
o 
o 

o 
I 



o 
o 

o 
I 




GO 
o 



o 



%0 
c5 



m 
o 



o 



CO 
o 



CXJ 
o 



T-l O 



Transfer Gain Function C/A 



09/30/2004, EAST Version: 1.4.1 



U.S. Patent 



Mar. 28, 1995 Sheet 6 of 7 5,402,369 




(li 



+ 

X 

+ 

c 



+ 
CO 

+ 

c 



CO 










cu 










c 




cn 


1 




ai 


X 






lO 


c 


1 



CO 



• • • 



• • • 



+ 



o 

in 



cu 
m 



CO 


1 




n 


X 






to 


c 


1 


00 




+ N 




c 





rt 



ID 



a. 

CL 

<: 



I 

o 
lO 

in 
I 



OQ 
1 




CU 




K 



00 
o 

in 




o 
in 




09/30/2004, EAST Version: 1.4.1 



U.S. Patent 



Mar. 28, 1995 



Sheet 7 of 7 



5,402,369 




09/30/2004, EAST Version: 1.4.1 



5,402,; 



METHOD AND APPARATUS FOR DIGITAL 
MU LTIPUCATION BASED ON SUMS AND 
DIFFERENCES OF FINITE SETS OF POWERS OF 

TWO 5 

BACKGROUND 

1. Field of the Invention 

The invention relates generally to digital circuitry. 
The invention relates more specifically to an improved 
method and apparatus for digital multiplication based 
on shifting of binary-coded data. 

2. Description of the Related Art 

Use of multiplication is widespread in digital elec- 
tronic circuits. 

The basic AND gate inherently performs a 1-bit by 
1-bit, binary-coded multiplication. Larger sized multi- 
plications in which an n-bit wide multiplicand signal (A) 
is multiplied by an m-bit wide multiplier signal (B) to 
produce a binary-coded result signal (C=A*B) that is 
k-bits wide, where the number of result bits k can be 
substantially greater than 1, are also ubiquitous. (In 
most cases, k^m+n. Sometimes the result can be ex- 
pressed with less than m+n bits through the use of 
truncation with or without round-off error correction.) 25 

Bxamples of reliance on digital multiplication may be 
found in fields such as digital signal processing (DSF) 
and neural networks. Different proportionality 
"weights" are routinely assigned in these fields to vari- 
ous parameter signals in order to provide a desired DSP 30 
filter function or neural net pattern. Parameter 
weighting is typically performed by digitally multiply- 
ing a parameter signal by a corresponding weight coef- 
ficient signal. In the case of real-time adaptive filtering 
operations and the like, input signals arrive at real-time 35 
speed (which can be quite fast). Weighting coefficients 
may have to change at real-time speed in response to 
rapidly changing input or other conditions. As a conse- 
quence, parameter weighting operations may also need 
to complete in relatively short time. 40 

Digital multiplication is, of course, also found in the 
field of general purpose computers. Phrases such as 
"scaling by a factor**, "modulating with another signal" 
and "attenuating by a factor" are often used in the field 
of signal processing as equivalents for the operation of 45 
generating a result signal that is coded to represent the 
multiplication of values represented by two or more 
input signals. 

Quite often, one or more of the multiplicand (A), the 
multiplier (B), and the result signal (C=A-B) represent 50 
a physical quantity such as, but not limited to: (a) X-ray 
or other tomography measurement signals that are 
being digitally processed as they are gathered in real 
time, or after collection, for purposes of improving 
image quaUty; (b) heartbeat or other medically-related 55 
measurements tiiat are being collected in real time and 
digitally processed for purposes of providing immediate 
diagnosis and treatment; (c) audio or ultrasonic signals 
that are being pre-processed in real-time prior to their 
production as physical sound waves or that are being 60 
post-processed after reception of physical counterparts; 
(d) video signals that are being pre-processed in real- 
time prior to production as physical light images or 
post-processed after reception of physical counterparts; 
and (e) digital telecommunication signals, such as used 65 
in modems and facsimile machines, where the digital 
telecommunication signals pass through a digital multi- 
plication process as part of a pre-emphasis (modulation) 



2 

operation or post-emphasis (de-modulation) operation 
or as part of an error-detection and/or error-correction 
operation or as part of a data compression or decom- 
pression operation. 

In many instances, several multiplication operations 
need to be carried out in a mass-produced circuit. Pref- 
erably, such a circuit should be of low cost, compact 
and composed of one or as small a number of integrated 
circuit (IC) chips as is economically and technologi- 
cally practical. It is often desirable to squeeze one or 
more multiplier circuits onto a single monolithic inte- 
grated circuit (IC) chip in a manner which minimizes 
circuit size, complexity and cost, and gives each IC chip 
a relatively high level of functionality and performance. 
Smaller circuit size and simpler circuit topology usuaDy 
go hand in hand with fewer mass-production defects, 
increased reliability, reduced power consumption, 
faster performance, and reduced costs. 

Ideally, each on-chip multiplier circuit should be of 
minimal size so that it can be squeezed economically 
into the linoited space of an IC chip together with like 
and other functional circuits of the IC. Bach on-chip 
multiplier circuit should also be very fast. It should 
complete its multiplication operations in Tninimfll time. 
Moreover, each multiplier circuit should be relatively 
accurate; meaning that it can produce a correct, or 
approximately correct, result signal (C=A'B) even 
when given an n-bit wide multipUcand signal (A) and 
m-bit wide multiplier signal (B), where n and m are 
relatively large numbers. 

Unfortunately, conventional approaches to multiplier 
design fail to attain ideal combinations of these charac- 
teristics without making substantial compromises. Mul- 
tiplication speed and the nxmiber of bits handled can be 
increased through the use of conventional parallel de- 
sign (e.g., a Wallace tree multiplier), but this tends to 
increase circuit size dramatically. Large circuit size is 
undesirable because it leads to decreased mass produc- 
tion yields, reduced per unit reliability, increased per- 
unit costs and higher levels of power consumption. 
Circuit size can be minimized through the use of a 
highly serial design, but then multiplication speed is 
disadvantageously reduced. 

SUMMARY OF THE INVENTION 

The invention overcomes the above-mentioned prob- 
lems by providing an improved method and apparatus 
for digital multipUcation based on sums and differences 
of finite sets of powers of two. 

It is observed that the values 0 through 8 can be each 
expressed as a sum or difference of two terms, where 
each of the terms is set equal to a power of two or to 
zero. 

. Multiplication of an n-bits wide, multiplicand signal 
(A) by a multiplier signal (B) is realized in accordance 
with the invention by a multiples generating system 
comprised of: (a) a plurality of barrel shifting units for 
providing bit significance translation; (b) a summing 
unit operatively coupled to the barrel shifting units; and 
(c) a mapping control unit operatively coupled to the 
barrel shifting units and the summing unit for control- 
ling the barrel shifting units and the summing unit. The 
multiplicand signal (A) is applied in parallel to the plu- 
ral barrel shifting units. Each barrel shifting unit re- 
sponds to a shift control signal (IJT) suppHed to that 
shifting unit from the mapping control unit and outputs 
a corresponding term signal (T) representing either a 
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zero or the multiplicand signal (A) shifted by a number speed, physical power requirements and of physical 

of bits defined by the shift control signal (I, J). The map interconnect requirements. 

control unit responds to a supplied, m-bits wide, multi- Consider first, a given multiplicand, A, that is to be 

plier signal (B). Resulting term signals (T,-, Tj) are ap- multipHed by a multiplier value of B =2 to produce the 

plied to the summing unit (adder/subtractor unit) for 5 result, C=A-B=A'2. The bullet symbol"." is used here 

producing a result signal (C) representing sums and/or to denote pure mathematical multiplication, 

differences of the term signals. The mapping control In distinction, an asterisk followed by a power of 

unit determines whether the summing unit will perform two, as in the notation "A*2'**', is generally used below 

an addition or a subtraction for each term signal Ap- to denote the physical act of shifting a representative 

propriate mapping logic within the mapping control 10 binary-coded signal such as A by i bit positions to the 

unit causes the result signal (C) to be a binary-coded left, if i is positive; or i bit positions to the right, if i is 

signal representing a product of the multiplicand signal negative to obtain a corresponding result signal repre- 

(A) and the multipHer signal (B). senting multiplication by the stated power of two. Trail- 

BRIEF DESCRIPTION OF THE DRAWINGS positions are padded with zeroes. Leading bit 

15 positions are padded with zeroes or ones depending on 

The below detaOed description makes reference to the sign of the shifted signal A. (The latter operation is 

the accompanying drawings, in which: referred to as sign extension.) 

FIG. 1 is a schematic of an n-by-3 bit multiplier mod- The expression A*2-^ therefore indicates that a rep- 

ule (or "graduator" module) in accordance with the resentative signal. A, is physicaUy shifted by i bits to the 

invention; 20 right to produce a result signal representative of the 

FIG. 2 is a detailed schematic showing a pass transis- mathematical expression A.2-', (An exception to this 

tor implementation for the barrel shifters and adder/- usage is the expression A*0, which is used here to mean: 

subtracter unit of FIG. 1; replace tiie A signal with a zero result signal) 

FIG, 3 is a schematic of an n-by-4 bit pseudo multi- The arrow symbol is used below to indicate 

plier module (or "graduator" module) in accordance 25 transition from the purely matiiematical domain to the 

with the invention; domain of physical implementation. 

FIG. 4 is a block diagram of an n-by-7 bit true multi- Given this, it is observed that the mathematical oper- 

plier module or. alternatively, an n-by-8 bit pseudo ation C=A.B=A.2 can be physicaUy implemented in 

multiplier module in accordance with the invention; many ways, including but not Hmited to one or more of 

FIG. 4B is a plot showing the gain and error of the 30 the following operations: 



side Physical implementation 

^•2 = ^.(4-2) A*22-A*2^ (Op!) 

^.2«^.(2-0) ^ A*2^ (Op2) 

A'2^A^(\ + \) A^-hA1° (Op3) 



*^"'^*(*+T+T + T + -^+-^ + -") ^-(20 + 2-1 + 2-2 + 2-3 + 2-V 



. ) (Op4) 



n-by-8 bit pseudo multiplier module of FIG. 4A for 
values of B equal to 0 through 255; 

FIG. 5 is a block diagram of an n-by-m bit true multi- 45 
plier system or, alternatively, an n-byH(m-}- 1) bit pseudo 
multiplier system composed of plural graduator mod- 
ules in accordance with the invention; and 

FIG. 6 is a block diagram of another n-by-m bit true 
multiplier system or, alternatively, an n-by-(m-|-l) bit 50 
pseudo multiplier system composed of plural graduator 
modules in accordance with the invention. 

DETAILED DESCRIFTION 

The invention takes advantage of a set of mathemati- 55 
cal truths to provide a multiplier circuit (or a signal 
modulating circuit) that can be made smaller and/or 
faster and/or less expensive than conventional multi- 
pher/modulator circuits. 

The workings of the invention are perhaps best ex- 60 
plained by first going through a set of mathematical 
exercises. 

It is to be understood that although the exercises are 
initially mathematical in nature, each exercise has physi- 
cal consequences when carried out by corresponding 65 
physical circuit elements; as will be explained in more 
detail below. The physical consequences include the 
establishment of physical circuit size, of physical circuit 



The physical implementation side (right side) of oper- 
ation Op2 is the conventional approach. One simply 
shifts the multiplicand signal A left by one bit position, 
sets the resulting least significant bit to zero CO"), and 
does nothing else. Such shifting can be carried out seri- 
ally or in parallel. Parallel shifting can be implemented 
by hardwired routing of bus wires from a source regis- 
ter to a result register, and by hardwiring trailing bit 
positions to have zero values, thereby providing a very 
fast and economical solution. 

The physical implementation sides of above opera- 
tions Opl and Op2 could be alternatively used, but they 
have the disadvantage of consuming more resources. 
Two shifters and a subtractor are used by Opl. Two 
shifters and an adder are used by Op3. It is worthy to 
note nonetheless, that a desired result for the computa- 
tion, A*2, can be obtained as a sum or difference of 
signals provided by two shift operations. 

The mathematical side Of Op4 provides a precisely 
correct answer only when carried out to infinity. Thus 
it is not possible to carry out an exactly equivalent oper- 
ation on the physical implementation side. It is worthy 
to note, however, that one can quantize the distance 
between a starting point (the number 1) and an end 
point (the number 2) to any level of finite precision by 
considering ever more negative powers of two. The 
precision of the result on the physical implementation 
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side increases as one adds more stages and the result 
error decreases. 

A more generic representation of Op4 is the follow- 
ing Op5: 



(Op5) 



B can be any value in the range 0 to 2. With appropriate 
selection of a plus or minus for each "±" operation, one 
can converge on any value of B in the range 0 to 2. 
Certain values (rational values) can be precisely 
reached with a finite number of ± operations while 
others (irrational values) require an infinite number of 
such operations. Even for vaJues that require an infinite 
number of ± operations, one can provide a usefully 
close approximation with a finite number of such opera- 
tions. It is to be noted as an aside, that for certain appli- 
cations, such as found in many DSP or neiiral network 
or "fuzzy logic" systems, a certain level of imprecision 
in the less significant bits of a result may be tolerable. 

The above observations appear to have little rele- 
vance until coupled with the following. 

Consider a number line extending from x=0 to x= 1. 
Divide the line into eight equal segments, or "quaii- 
tums". The division points fall at x=0, 0.125, 0.25, 
0.375, 0.50, 0.675, 0.75, 0.875 and 1.00. The positional 
values of the segment end points can be expressed as 
fractions of the form P/Q, where Q=8, and P steps 
over the range of integers, 0 to 8. The same values can 
also be expressed in terms of sums or differences of 
integral powers of two as shown in the below Table-1 A. 

TABLE-IA 



Frac 
P/Q 


Dec = 


Ti = 
Two to 

PWT - i 
or zero 


Op = +/- 


Tj = Two to 
Pwr — j or zero 


0/8 


0,000 = 


2-3 




2-3 


1/8 


0.125 = 


■0- 


+ 


2-3 


2/8 


0.250 = 


2-1 




2-2 


3/8 


0.375 = 


2-1 




2-3 


4/8 


0.500 = 


2-1 




-0- 


5/8 


0.625 = 


2-1 


-H 


2-3 


6/8 


0.750 = 


2-0 




2-2 


7/8 


0.875 = 


2-0 




2-3 


8/8 


1.000 = 


2-0 




-0- 



10 



15 



20 



25 



30 



35 



40 



45 



of two value, 2-^. To get to the value 0.875, one starts 
at an integral-power of two value such as, 2-0, and one 
steps down by another integral-power of two value, 
2-3. 

The combination of sums or differences of powers of 
two and the constant, zero, that are expressed above in 
Table-1 A are not necessarily the only ways to reach the 
target values. The value P/Q =0.750 for example can be 
obtained by the summing operation 2- ^ 4-2-2 instead of 
the illustrated difference operation, 2-0— 2 -2. The 
value P/Q=0 can be obtained by adding zero plus zero 
or by subtracting any equal pair of values. Table-1 A is 
merely illustrative of the concept Values that are not 
themselves powers of two, can be constructed from 
sums and differences of powers of two. Preferably, one 
starts at a power of two that is closest to the target 
value, and then graduates up or down by a smaller value 
that is also a power of two in order to reach the target 
value. For all cases of P equals 0 to 8, the fraction P/8 
can be constructed as a sum or difference of just two 
terms, T/ and Ty, where each term is equal to a negative 
power of two (2-' or 2-J) or a zero value (—0—). 

Table- IB (bdow) shows how a physical implementa- 
tion can be constructed according to IF-THEN rules. 



The columns of Table-IA represent, in left to right 50 
order, the target value expressed as a fraction P/Q, the 
target value expressed as a decimal, an equal sign, a first 
term T, that is equal to either a first negative power of 
two (2-') or a zero value (—0-), an addition or sub- 
traction operation (-|-or — ), and a second term Tj that is 55 
equal to either a second negative power of two (2-/) or 
a zero value (—0-). Table-IA therefore shows how 
each fraction P/Q, for P— 0 to Q, can be formed of a 
sum or difference (-f /— ) of a first tenn T/and a second 
term signal Tj where each term is equal to a negative 60 
power of two (2-* or 2~f) or a zero value (—0—). 

Note that the values, 0.125, 0.25, 0.5 and 1.00 can be 
expressed in terms of a single integral-power of two 
(plus or minus a zero value), but the values 0.375, 0.625, 
0.75, and 0.875 are expressed as differences or sums of 65 
integral-powers of two. In other words, to get to the 
value 0.750, one starts at the mtegral-power of two 
value, 2-^ and one steps up by another integral-power 



TABLE-IB 


Then: 


Then 


Then: 


If -Pwri 


Op 


-Pwr j 


P = or zero 


+/- 


or zero 


0 — 


? 




1 — 


. + 


3 


2 1 




2 


3 1 




3 


4 1 


7 




3 1 




3 


6 0 




2 


7 0 




3 


8 0 


7 





The columns of above Table- IB represent, in left-to- 
right IF/THEN order, the target value expressed by 
the numerator portion (If P=) of the fraction P/8, a 
consequential fu^t negative power i for two (2-0 or a 
consequential zero value (expressed as " — "), a corre- 
sponding addition or a subtraction or a don*t care opera- 
tion (+ or — or ?), and a consequential second negative 
power j for two (2-J) or a zero value ( — ). Table- IB 
therefore shows how each fraction P/8, for P=0 to 8, 
can be constructed of a sum or difference (+/— ) of two 
selected terms, where each term is picked with repeti- 
tion allowed, from a group consisting of zero and the 
finite set of powers of two: 2°, 2-*, 2-2 and 2-3. 

FIG. 1 shows an apparatus 100 that is structured in 
accordance with the invention for carrymg out the 
IF/THEN operations specified in Table- IB to produce 
a desired result signal C=A-(P/8) for each integer 
value of P in the range 0 to 7, or 1 to 8; where P is 
mapped from a supplied multiplier signal, B. 

Input signal A can have a word width of any number, 
n, of parallel bits. The width number, n is preferably 
equal to or greater than 4 to enable nibble-wide multi- 
plication, and more preferably equal to an industry 
standardized data-word size such as 8-bits wide, 12-bits 
wide, 16-bits wide, 24-bits wide, 32-bits wide, 64-bits 
wide, and so forth. 

Input signal B is preferably restricted to 3 bits and the 
corresponding eight binary bit patterns of B equals 000 
through 111, are preferably mapped on a one-to-one 
linear basis to cover the cases of either P equal to zero 
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through seven or P equal to one through eight. When consecutive sequence of shift amounts (0, 1) or as a unit 
this is so, apparatus 100 produces a result signal C repre- for outputting a signal representing the zero value, 
sentative of either C=A-(B/8) for each integer value of In similar fashion, a second multiplexer 120 is pro- 
B in the range 0 to 7, or C=A-((B+ 1)/8) for each inte- vided having three input ports respectively denoted as 
ger value of B in the range 0 to 7. In such a case, appara- 5 120a, 12Xib and 120c, each having an effective width of 
tus 100 is accordingly referred to as an n-by-3 bit multi- n or more bits, a selection control port 120dc, and an 
plier module 100. If desired, B can be expanded to be 4 output port 120o which is n4-3 bits wide. A second 
or 5 bits wide so that it can cover a wider range of control signal, J, is applied to selection control port 
values for P. 120x to select one of mput ports 120fl, 1210b and 120c, as 
In the n-by-3 bit multiplier module 100 of FIG. 1, a 10 the mput port whose signal will be output on multi- 
first multiplexer 110 is provided having three input plexer output port I20o, 

ports respectively denoted as llOa, 110^ and 110c, each Input port 120c receives a signal representing the 
having an effective port width of n or more bits. First value zero (0). Input port 120fl receives an n-f 2 bits- 
multiplexer 110 is further provided with a selection widesignalrepresentingthe value of the multiplicand A 
control port llOx for selecting one of the three input 15 times a third power of two (2-2) which is one less than 
ports llOfl, 110^ and 110c. First multiplexer 110 is fur- the second power of two (2-i) used at input port 110c. 
ther provided with an output port llOo for outputting A symbol 102 representing a times one-quarter multi- 
the signal applied to the selected one of the three input pUer which could also be represented as *2-2) is 
ports llOfl, llQb and 110c. For reasons that will become shown interposed between the multiplicand A signal 
apparent shortly, output port llOo is shown to have a 20 supply bus 109 and input port 120a for carrying out 
bus width of n -1-3 bits. multiplication by this power of two. Input port 120^ 

A first 2 or 3-bits wide control signal, I, is applied to receives an n-|-3 bits- wide signal representing the value 

the selection control port llOx of first multiplexer 110 of the multiplicand A times a fourth power of two (2-3) 

for selecting one of input ports 110a, 110^ and 110c, as which is one less than the third power of two (2-2) used 

the input port whose signal will be output on multi- 25 at input port 120a. A symbol 103 representing a times 

plexer output port llOo. one-eighth multiplier (*|, which could also be repre- 

Input port 110a receives a signal representing the sented as *2-3) is shown interposed between the sup- 
value zero (0). This input port llOc can be thought of as plied multiplicand A signal and input port 1206 for 
being n-bits wide although it is just one wire carrying carrying out multiplication by this power of two. 
the same zero bit to many places. The effect of selecting 30 The combination of second multiplexer 120, shift 
a zero can be realized in some multiplexer designs by units 102 and 103, and the zero value supplying input 
de-selecting both of the other input ports, 1106 and (120c) defines a second "or-zero" barrel shifter 125 

which functions either as a barrel shifter for outputting 

Input port 1106 receives an n-bits wide signal repre- a signal representative of A shifted by one of a consecu- 
senting the value of the multiplicand A times a first 35 tive sequence of left shift amounts (2, 3) or as a unit for 
power of two. The illustrated case shows A times one outputting a signal representing the zero value. 
(A* I or A*20). In this particular case, the n-bits wide A With appropriate settings of control signal I, the first 
signal is simply fed directiy into input port 110a. A "or-zero" barrel shifter 115 can be controlled to output 
symbol 90 representing a times-one multiplier (*1, a fu^t term signal, T/, that is n-i-3 bits-wide and repre- 
which could also be represented as is shown inter- 40 sents one of the values: 0, A*2-0or A*2- 1. With appro- 
posed between the bus 109 that supplies the multipli- priate settings of control signal J, the second "or-zero" 
cand A signal and input port 1106 for illustrating a barrel shifter 125 can be controlled to output a second 
general relation with other power of two multipliers term signal, Tj, that is n-f 3 bits-wide and represents one 
101, 102 and 103 of FIG. 1. of the values: 0, A*2-2 or A*2-3, 

Input port 110c receives an n-hl bits-wide signal 45 The n-by-3 bit multiplier module 100 further includes 
representing the value of the multiplicand A times a an adder/subtractor unit 130 having first and second 
second power of two (2-^) which is one less than the input ports 130a and 1306 respectively coupled to re- 
first power of two (2°) used at input port 1106. A sym- ceive the first and second term signals, T/ and Ty, from 
bol 101 representing a times one-half multiplier (♦J, the output ports llOo and 120o of the first and second 
which could also be represented as *2->) is shown 50 multiplexers. Itiput port 130a receives the T/ signal for 
interposed between the multiplicand A signal supply summing T/into the result signal C that is produced by 
bus 109 and input port 1106 for carrying out this multi- output port IdOo of the adder/subtractor unit 130. Input 
plication by a power of two. port 1306 receives the Ty si^ for either summing Ty 

It is understood that, for binary-coded representa- into or subtracting Ty out of the result signal C. 

tionsofnumerical values, multiphcation of the multipli- 55 (C=T, ±Ty.) A sign-control signal, K is applied to a 

cand signal A by a power of two can be realized in FIG. sign select control port 130x of the adder/subtractor 

1 by a hardwired parallel shift with the more significant unit 130 to determine whether the addition or subtrac- 

bits in the n-bits wide A signal being padded with zeroes tion of second term signal Ty is performed. Output port 

in the case where A is positive. (A string of leading ones 130a therefore produces a result signal C equal to (0, 

would be used in the case where A is negative.) The 60 A*2-o or A*2-i) plus or minus (0, A*2-2 ©r A*2-3). 

symbol "♦2'*' is used in FIG. 1 to denote a shift opera- Output, port 130o is either n+3 or n+4 bits-wide, de- 

tion performed to effect multiplication by the indicated pending on the range of values selected for P (0 through 

power of two, i. 7, or 1 through 8). 

The combination of first multiplexer 110, shift units It is to be understood that even though the effective 

90 and 101, and the zero value supplying input (110a) is 65 bit widths of n-|- 1 through n-i-3 are shown for the sig- 

referred to here as a first "or-zero" barrel shifter 115 nal carrying paths between the n bits-wide, multiplicand 

because it functions either as a barrel shifter for output- A signal supply bus 109 and the nH-3 bits-wide, output 

ting a signal representative of A shifted by one of a port 130o, each such path does not necessarily have a 
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full complement of n+x physical wires (x denotes here tor module since its main architectural limitation is the 

1, 2 or 3). A physical wire is not required for a bit posi- formation of the result signal C from a sum or difference 

tion that is always stuck at logic low C'O") or logic high of two term signals (T/ and Tj) where each term signal 

("1"). The effective bit widths of n + 1 through n-h3 are represents either zero or A multiplied by a power of 

shown for the purpose of indicating the logical align- 5 two. Arbitrary mappings of B to P may be used, if de- 
ment of bits on parallel paths. For instance, there are sired, to provide an encrypting function. But this is 

only n+ 1 variable input bits at input port IdOa of the perhaps too premature a point in our discussion to raise 

addcr/subtractor unit 130. These align with the most these concepts. 

significant n + 1 bits of the n-f-3 bits output at output The combinatorial logic circuitry of mapping control 
port 130o. Similarly, there are only n+1 variable input 10 unit 150 can be designed to be of minimal size and/or 
bits at input port 1306 of the adder/subtractor unit 130. maximxim speed using well-known Kamough mapping 
These aUgn with the less significant n-f 1 bits of the techniques or the like. Alternatively, mapping control 
nH-3 bits output at output port 130o. unit 150 can be formed from an appropriately pro- 
Control si^ials I, J and K are produced by a mapping grammed ROM, PROM, EEPROM, or other suitable 
control unit 150 (a decoding unit) in response to a multi- 15 random access memory means. 

plier signal (B) supplied to a control input port 15Qx of In yet another alternative embodiment, a sequence of 

unit 150. Mapping control unit 150 conceptually per- settings for control signals I, J, K can be produced by a 

forms two successive mappings, first from B to P, and one or more sequential signal generators, and the value 

then from P to I, J and K. In practice, these two map- for B or P can be inferred by selecting a corresponding 

pings can merge into one. 20 one of the generated settings for control signals I, J, K. 

Mapping control unit 150 is preferably formed as a There is no time delay for going from B to I, J, K be- 
combinatorial logic circuit which has the 3-bits wide B cause I, J, K are generated immediately and P or B are 
multiplier signal as its input (150x) and control signals I, hypotheticaUy inferred from the generated values of I, 
J and K as its outputs. The combinatorial logic of map- J, EL This inferential embodiment is particularly useful 
ping control unit 150 performs its P to I, J, K mapping 25 in applications where the settiags for control signals I, J, 
in accordance with above Table- IB or an equivalent. K turn out to be periodic over time. An example is a 
The combinatorial logic of mapping control unit 150 case where the multiplicand A signal is to be used to 
additionally performs a liner mapping of B to P and modulate a carrier frequency signal (B) having a fre- 
thereby drives the result signal C equal to the product quency that is a subharmonic of the data sample rate of 
A-B/8, where B can represent any one of eight sequen- 30 the multiplicand A signal. (The sample rate of the A 
tial values selected from the series of integers zero to signal is equal to the carrier rate multiplied by an integer 
eight (0, 1, 2, . . , , 8). . greater than one.) Calculations of the settings for con- 
Actually, with some minor modifications to the cir- trol signals I, J, K do not necessarily need to be done in 
cuit of FIG. 1; the 3-bit wide codes for B can be mapped real-time at high speed. The sequence of settings for 
over to any eight P values for P selected from the series 35 control signals I, J, K over a single period can be gener- 
of integers, minus ten to plus ten (— 10, — 9, , . . , 0, 1, 2, ated by a slow but compact-sized logic circuit; stored 
, . . , 10), as will be explained shortly. The number of thereafter in a circulating shift register, and cycled 
values allowed for P is preferably limited to eight in through over and over again as carrier modulation later 
order to limit the niunber of B input bits to 3 bits and proceeds in real time. 

thereby minimize circuit size and complexity. 40 The combinatorial logic implementation for mapping 
If a larger number of values for P is desired, the con- control unit 150 is generally preferred over a ROM or 
trol input port 150bc of mapping control unit 150 can be like mapper. The signal propagation time of the combi- 
expanded to 4 bits wide to permit a selection of as many natorial logic implementation, from the B input port 
as sixteen values for P in the range — 10 to + 10. If both (150bc) to the I, J, K output ports, is generally shorter, 
negative and positive values of P are to be supported, an 45 and the circuit size of lie combinatorial logic imple- 
additional line should be added to output port 130o of mentation is generally smaller than that realized with 
the adder/subtractor unit for indicating polarity. the ROM approach. Note that for the inferential ap- 
If even a larger number of values for P is desired, the proach, there is no delay in going frpm B to I, J, K 
control input port 150* of mapping control unit 150 can because the latter signals are generated directly, with- 
be expanded to 5 bits wide to permit a selection of any 50 out translation from a suppHed B or P value, 
values for P in the range —10 to +10. The 5-bit wide One advantage of the n-by-3 bit multiplier module 
implementation of control input port 15Qx is less pre- 100 is that signal propagation delay from the multipli- 
ferred than the 3 or 4 bit implementations because it cand A input port to the result signal C output port is 
does not make optimum usage of all 5-bit wide patterns relatively small. The power of two shifting units 101, 
that could be applied into the control input port 150x of 55 102 and 103 (♦i, *}, * J) can be constructed as hardwired 
mapping control unit 150. parallel bit shifts that have insignificant delay. First and 
The eight or sixteen chosen values for P do not even second multiplexers, 110 and 120, can be constructed to 
have to be sequential. Any arbitrary set of values for P also have relatively small signal propagation delays, 
can be picked from the range — 10 to + 10. Given this, Adder/subtractor unit 130 can be made to have a rela- 
module 100 is more correctly defined as a signal "modu- 60 tively short delay when designed, for example, accord- 
lator", or a signal "scaler", or a signal "graduator" ing to a well known "carry look-ahead" approach. Al- 
rather than merely as an n-by-3 bit multipUer because tematively, an equally well-known "carry-ripple" ap- 
module 100 can be designed to output selected gradua- proach could be used. Although the carry-ripple design 
tions, C=A-P/8, of the input A si^ial, for a predefined is slower than the carry look-ahead design, the carry- 
subset of P's selected by a suppUed index signal (B) from 65 ripple design has the advantage of requiring less circuit 
the integer set — 10 to 4- 10; the graduations being in space to implement. In cases where propagation time is 
increments of A/8. Module 100 might be more properly not very critical, and small circuit size is more critical, 
referred to as a "sum/difference of two terms" gradua- the slower but smaller carry-ripple design is preferred 
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over the faster but larger carry look-ahead design. Gen- The source of pass transistor 211 is connected to the 

eral-purpose arithmetic logic units (ALU's) can also be vertical bus wire canTing the zero signal (ground), 

used to implement the selectable add/subtract function. Application of a logic high ("I ") to multiplexer control 

The choice of circuit design for implementing the se- line la turns pass transistor 211 on and thereby couples 

lectable add/subtract function will be generally die- 5 the zero signal level to input node 214a. Inverters 214 

tated by the overall goals of the circuit in which the and 215 couple this signal as the most significant bit 

n-by-3 multiplier module resides. (Ti(n-. i)) of term signal T/ thereby effecting part of an 

Equivalent gate counts or transistor counts are tradi- A*0 operation, 

tiondly used as benchmarks for determining circuit size desired, a latch feedback transistor 216 is optionally 

and/or circuit speed. (For the latter, one counts the 10 provided in the circuit for selectively feeding back the 

number of gates connected in series in a critical path.) output 215o of inverter 215 to the input node 214<z of 

However, layout and interconnect routing often times inverter 214, thereby forming a data storage latch for 

play equally important roles in determining circuit size holding the logic low ("0") or logic high ("1") level 

and/or circuit speed. One of the advantageoiis features output by first multiplexer cell 210(n— 1). This data 

of the n-by-3 bit multiplier module 100 (or "graduator" storage latch can be used for pipelined operation of the 

module 100) is that its physical components can be multiplier module. Vertical latch control line 217 runs 

arranged on an integrated circuit in a layout topology parallel to bus 201 and connects to the gate of pass 

that requires relatively little space for interconnect transistor 216 for controlling the turn-on and turn-off of 

This helps to reduce consumption of die area and to also that transistor 216. 

reduce signal propagation delay due to excessive inter- ^0 ,.The illustrated topology 200 also mcludes a carry-np- 

connect length pang adder 230 havmg an n-bits wide, A-mput port 

FIG. 2 is a detailed schematic showing a pass-transis- an- n-bits wide, B-input port (BIN); a 1-bit wide 

tor implementation for the "or-zero" barrel shifters and ?^i?P^t P^rt (CYi) and an n-bits wide C-output port 

adder/subtracter ofFIG.l that requires relatively Uttle „ Lme 231 comiects the ou^ut n^^^^ 

space for intercomiect. The topology of this pass-tran- ^fJ^' (^SB) of the A-mput 

sistor implementation is refereiced^ 200. ^Al^^^^K''^ the carry-npphng adder 230. 

A first vertical bus 201 extends through topology 200 . Although not shown, it is to be understood that the 

for carrying the n bits of the multiplied si^ (A) in ™tn^offintmdtiplexer ceD210(n-l)andmverters 

11 1 ^ " . J ; ^Zr 214,215 IS repeated a number of times, as one moves 

parallel. The mdi^dualhnesofbus2W 3, dow first ve'rtical bus 201 to thereby define the i^bi!^ 

Aa Ai, A2, . . . , A;.2, A„ 1. Firs veitic^ bus 201 also ^^^^ ^ multiplexer 210 corresponding to 

cames the logic zero m signal which m tto miple- firstmultiplexer llOof FIG. 1. Multiplexer control Les 

mentation IS represented by gromid. Bus 210 IS A^^^^ ^/l^ continue down vertic^y as indicated by 

° It . ^ . 202 to control the pass transistors of the remaining cells 

A first mdtiplexer ceU 210(n-l) is formed of three 35 n^^itipiexer 210. Latch control line 217, when in- 

N-chaniiel MOS field effect msto^ 211, 212 and ^i^^ed, also continues downwardly as indicated by 203 

^^^tl^'l^nf ^^""^ control hues la. lb and Ic run paraUel control the latch feedback transistors (216) of the 

to bus 201 and respectively ^miect to the gat^ of remaining latches (214/215). The comiections made 

transistors. 211, 212 and 213. The source terminals of fr^m the source termmals of the pass transistors 

transistors, 211, 212 and 213 connect by way of horn 40 (2H-213) in the remaining ceHs of multiplexer 210 to 

taUy rumung hues to selectable ones of the vertical hnes vertical bus 201 will vary as understood by those 

(Aa-A„.i or 0)m bus 201. The drain terminals of transis- ^]^^ in the art to provide the A*0, A*l and A*i 

tors, 211, 212 and 213 are tied together and connect to fimctions. The more significant bits of shifted right 

an mput node 214^ of a signal amplifying inverter 214. versions of the multipHcand signal (A) are padded with 

Logic highs C 1") are apphed to the multiplexer con- 45 zeroes in the case where A is positive. If A is negative, 

trol hnes la, lb or Ic on a mutually exclusive basis. the leading bits are padded with ones. 

Because first multiplexer ceU 210{n- 1) is used for de- Those skilled in the art of mtegrated circuit fabrica- 

fining the most significant bit (T|(„.i)) of the first term tion will note that this topology 200 is very compact, 

signal, Ti, the source of pass transistor 212 is connected The first vertical bus 201 can be formed in the first 

to the vertical bus wire carrying the most significant bit 50 metal layer (metal-1) of an integrated circuit while mul- 

(Art.i) of the n-bits wide multipUcand signal (A). Appli- tiplexer control lines la, lb and Ic can be formed in an 

cation of a logic high ("1") to multiplexer control line lb underlying first polysilicon layer (poly-1). Pass transis- 

turns pass transistor 212 on and thereby couples the tors 211-213 consume very Uttle space in the substrate 

most significant bit (A„-i) of the multiplicand signal (A) of the integrated circuit The drain-to-drain connections 

to input node 214fl. Inverters 214 and 215 are series 55 between the transistors 211-213 can be made in the 

coupled to amplify and transfer this signal as the most substrate. The source to bus 201 connections can be 

significant bit (Tx(„.i)) of first term signal T/, thereby made by vias rising from the substrate to the metal-1 

efiiecting part of an A* 1 operation. lines. 

The source of pass transistor 213 is connected to the A second multiplexer 220, corresponding to multi- 
vertical bus wire carrying the second most significant 60 plexer 120 of FIG. 1, is formed of cells such as 
bit (An.2) of the n-bits wide multiplicand signal (A). 220(n— 1) also shown in FIG. 2. Multiplexer cell 
Application of a logic high ("1") to multiplexer control 220(n— 1) is composed of pass transistors 221-223 and 
line Ic turns pass transistor 213 on and thereby couples configured similarly to the already-described first multi- 
the second most significant bit (A„,2) of the multiph- plexer cell 210(n-l). A different set of multiplexer 
cand signal (A) to input node 214fl. Inverters 214 and 65 control lines Ja, Jb and Jc connect to the gates of tran- 
215 couple this signal as the most significant bit (T/(„.i)) sistors 221-223 though. Control lines Ja. Jb and Jc run 
of term signal T/thereby effecting part of an A* J opera- parallel to bus 201. The sources of pass transistors 
tion. 221-223 connect to selectable ones of the wires (Ao 



09/30/2004, EAST version: 1.4.1 





0-4 


-3-1-2 


1 - 4 


-2 0-2 


2-4 


-10-1 


I - 2 


0 070 


7 — same? 


1 l?0 


2 - 1 


2 270 


4-2 


3 2+1 


4 - 1 


4 470 


8 - 4 


5 4 + 1 




6 4 + 2 


8-2 


7 8-1 




8 870 


16-8 


9 8 + 1 




10 8 + 2 




n 


8 + 2+1 16-4-1 



5,402,369 

13 14 

through A„-i and 0) in first vertical bus 201. The drain of the adder/subtractor unit 130 similar to that akeady 

terminals of transistors, 221-223 are tied together and provided at input 130*. Mapping control unit 150 would 

connect to an input node of inverter 224. Inverters 224 then need to be redesigned to control the plus/minus 

and 225 are series coupled to amplify and transfer either selection at such a modified input 130a, 

this signal (Tjin-i)) or its one's complement to the most 5 TABLE 1C 

significant bit line 232 of the B-input port (BIN) of the 

carry-rippling adder 230 by way of a l-of-2 multiplexer . Alternate Sum of Alternate 

formedbyN-chaaneltnmsistors234and235.Ifdesired, vl^ 'Tel ^T^' tL. ^.1' 
a latch feedback transistor 226 can be provided for 
selectively feeding back the output of inverter 225 to 10 
the input of inverter 224, thereby forming a data storage 
latch for holding the logic low ("0") or logic high ("1") 
level output by second multiplexer cell 220(n— 1). Ver- 
tical latch control line 217 connects to the gate of latch 
feedback transistor 226 for controllmg the turn-on and 15 
turn-off of that transistor 226. 

Transistor 234 selectively couples the output of in- 
verter 224 to hne 232. Transistor 235 selectively couples 
the output of inverter 225 to line 232. A sign definmg 
signal, K, drives the gate of transistor 234. The comple- 20 
ment of K is supplied to the gate of transistor 235 by 
way of inverter 236. Sign defining signal K also drives 
the carry-in input (CYi) of adder 230. When K is at 

logic high ("1"), a carry is introduced into adder 230 Any one of the values 0 through 10 can be expressed as 

and the complement of signal (Tj(n_i)) passes to the 25 a sum or difference of two terms where each term is a 

most significant bit line 232 of the B-input port of adder positive or negative integer equal to either an integral 

230. When K is at logic zero C*0'*)t no carry is intro- power of two or to zero. Of course, the power of two 

duced into adder 230 and the noncomplemented version multiples (A*2-' and A*2-f) output by each of multi- 

of signal (Tj(n- 1)) passes to the most significant bit line plexers 110 and 120 may have to be varied to accommo- 

232 of the B-input port of adder 230. 30 date desired sets of values for A-P/8. As already noted, 

Although not shown, it is to be understood that the the value P= 1 1 requires three such terms and therefore 

circuitry of second multiplexer cell 220(n— 1), inverters cannot be implemented simply with a two port adder/- 

224, 225 and multiplexer 234/235 is repeated a number subtracter imit such as 130. 

of times, as one moves down first vertical bus 201 to The negative range, P = — 1 through — 10, can be 
thereby define then-bits or wider ports of a multiplexer 35 implemented in a number of ways. One can use the 
220 corresponding to second multiplexer 120 of FIG. 1 range, P= -h 1 through +10, and selectively perform a 
and the add/subtract selector of FIG. 1. two's complement operation (not shown) at Uie C out- 
Multiplexer control lines Ja, Jb and Jc continue down put 130o of FIG, 1. Alternatively, one can modify input 
vertically as indicated by 202 to control the pass transis- port 130a of the adder/subtractor unit 130 to have a 
tors of the remaining cells in multiplexer 220. The K 40 selectable plus-or-minus capability (not shown), as al- 
and K-complement lines continue down verticaUy as ready suggested above. The latter approach may be 
indicated by 204 to control the pass transistors usefiil in cases where both positive and negative values 
(234/235) associated with the remaining cells. The con- of P are desired. An asymmetrical modulator design, for 
nections made from the source terminals of the pass example, might call for an output C== A-(P/8) where P 
transistors (221-223) in the remaining cells of multi- 45 is selected from the range of eight values, —5 to -i-2. It 
plexer 220 to first vertical bus 201 will vary as under- is well within the purview of those skilled in the art to 
stood by those skilled m the art to provide the A*0, A*i construct an ALU for such a purpose which selectively 
and A* J functions. The more significant bits of shifted generates either the sum or difference of a plurality of 
right versions of the multiplicand signal (A) are padded supplied terms signals, T/, Ty, etc., where each term 
with zeroes in the case where A is positive. Leading 50 signal can represent either a positive or negative multi- 
ones are used in the case where A is negative. pie of a given multiplicand A signal. 

It has akeady been hinted that, with some minor If embodiment 100 (FIG. 1) is further modified by 

modifications, the n-by-3 bit mxiltiplier module 100 or expanding the B input port 15Q;c of mapping control 

"graduator" module 100 of FIG. 1 can be used to drive unit 150 to four bits rather than the illustrated three bits, 

the result signal C such that C defines a binary-coded 55 control unit 150 can be configured to drive result signal 

value equal to the product A*P/8, where P can repre- C to any one of 16 values representing A-P/8 in the 

sent any one of eight values selected firom the consecu- range of integers P equals — 10 to -h 10. If mapping 

tive series of integers, minus ten to plus ten (— 10, —9, control input port I50x is further expanded to five bits, 

— 8, — 7, . . , , 0, 1, 2, , . . , 10), The below Table- IC control unit 150 can be configured to drive C to any one 

shows why. Note that the last row of Table- IC calls for 60 of as many as 21 values in the range of integers — 10 to 

a sum of three rather than two terms and that this last -f- 10. Negative outputs will, of course, require an extra 

result of P = 1 1 cannot be formed therefore as a sum or output line for the sign bit. 

difference of just two terms, T/±Ty. Note also that When mapping control unit 150 is configured to pro- 

below Table-lC presupposes an ability to handle term duce selected graduations, C=A*P/8, of the input A 

signals T/ and Ty where either can be positive or nega- 65 signal, for a predefined, sequential subset of P values 

tive. One of the modifications that may have to be made selected from the consecutive integer set — 10 to H- 10; 

to the implementation 100 shown in FIG. 1 is incorpo- the output signal C takes on the linear transfer form: 

rating a selectable plus-or-minus function for input 130fl C= A-(B-f F)/8. where B is a supplied index signal and 
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F is an offset constant. Mapping control unit 150 can adder/subtracter unit 330. These align with the most 
also be configiired to produce selected graduations, significant n+2 bits of the n-f 4 bits output at output 
C=A-P/8, of the input A signal, for a predefined, port 330c?. Similarly, there are only n+2 variable input 
NONconsecutive subset of P values selected from the bits at input port 330^ of the adder/subtractor unit 330. 
consecutive integer set — 10 to + 10. In such a case, the 5 These align with the less significant n-f 2 bits of the 
output signal C can take on a NONlinear transfer fimc- n-|-4 bits output at output port 330a, 
tion of the generalized form: C= A*f(B)/8, where B is a Mapping control unit 350 receives a 4-bits wide, mul- 
supplied index signal and f(B) is an arbitrarily chosen tiplier signal (B) on control input port 350x and con- 
conversion function for mapping each input value of B verts it to appropriate control signals L, M and N so 
to a predefined and corresponding value of P. Nonlin- 10 that result signal C represents the product A-(P/16) for 
ear transfer functions find use in fields such as telecom- all values of P between 0 and 15 (or 1 and 16) except the 
munications, where signals may require nonlinear em- values P=ll and P=13. In one embodiment of the 
phasis and/or de-emphasis respectively before and after n-by-4 bit pseudo multiplier module 300, an input of 
transmission throu^ a telecommunications channel. b=P=11 produces the approximating result 
Arbitrary transfer functions may also be used for en- 15 C=A*10/16 and an input of B=P=13 produces the 
cryption and decryption functions. approximating result C==A*12/16. It is to be under- 
Referring to FIG. 3, suppose one wished to perform stood that the topology 200 shown in FIG. 2 can be 
an n-by-4 bit multiplication, where a 4-bits wide multi- applied to FIG. 3 in a fashion ftimilar to that applied to 
plier signal (B) represents one of sixteen values in either FIG. 1. 

the integer series 0 to 15, or the integer series 1 to 16. 20 xhe descriptors "approximate" or "pseudo" multipli- 

The n-by-4 bit pseudo multiplier module 300 shown in cation were used above in the description of the n.by-4 

FIG. 3 could be used to "approximate" such a multipU- bit module 300 to indicate that module 300 does not 

cation operation. (The reason for the qualifier "approxi- accurately replicate all the results of the A times B/16 

mate" will be clarified shortly.) multipHcation table. To be precise, the results for 

Like reference symbols and numbers in the "300" 25 a.11/16 and A.13/16 require a sum or difference of 

senes are used for elements ofFIG. 3 which correspond three terms (each a power of two) rather than two 

to but are not necessarily the same as the elements rep- terms. This is shown in below Table-2. 
resented by similar symbols and reference numbers of 

the "100" series in FIG. 1. As such, a detailed descrip- TABLE-2 

tion of the elements found in FIG. 3 is omitted here. 30 Alternate Sum of Alternate 

In brief, multiplexer 310 has four input ports Target Sum of Sum of 2 3 Sum of 3 

310a-310rf for selecting one of respective signals A*0, ^ ^ "^"^ 
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A*l, A*i, and A*i as the signal to be output from 
output port 310o in response to a control signal L ap- 
plied to control port 310x. Multiplexer 320 similarly has 35 
four input ports 320a-320^ for selecting one of respec- 
tive signals A*4, A* J, A* 1/16, and A*0 as the signal to 
be output from output port 320t7 in response to a control 
signal M applied to control port 32Qbc. Shift units 301, 
302a, 3026, 303 and 304 are interposed between the n 40 
bits- wide bus 309 carrying the multiplicand A signal the 
corresponding buses of effective bit-widths, n+ 1, n-|-2, 
n+2, n+3, n+4, that carry respective signals A*i, 
A*i, A*i A*i, and A*l/16. The combination of first 
multiplexer 310, and its shift units 290, 301 and 302a, 45 
and the zero value supplying input (310a) defines a first - 
"or-zero" barrel shifter 315. The combination of second 
multiplexer 320, and its shift units 3026, 303 and 304, It should be apparent that all values shown in Table-2 
and the zero value supplying input (ZTXid) defines a are in terms of sixteenths. The operation for realizing a 
second "or-zero" barrel shifter 325. First and second 50 target multiplication ofA*12/16 can be implemented by 
term signals, Tl and Tm, are supplied to respective, A*(8/16+4/16) or by A*(16/l 6—4/16) as indicated in 
nH-4 bits-wide mput ports 330a and 3306 of adder/sub- the row having a ''target value" of 12. Although values 
tractor unit 330 for producing, in response XjQtA sign such as 7/16 can be formed of three terms, 
select signal N that is applied to control port 330x, a (4-1-2+ 1)/16. the expressions which use only the sum 
result signal C representing either the sum or difference 55 or difference of two terms are shown while the expres- 
between the first and second term signals, Tx, and* Tm- sions which rely on a sum or difference of more than 
Result signal Ccanben+4orn+5 bits-wide depending two terms are hidden when possible. Note that the third 
on the values that may be presented at the inputs' 330a from left column (Sum of 2 Terms) of Table-2 is ar- 
and 3306 of adder/subtractor unit 330. (Note that the ranged such that smaller adjustments of ±0, ±1, ±2 
term value T=A*4/16 can be output from each of 60 and ±4 are on the right side of the two-term expression 
multiplexers 310 and 320. The term value T=A*0/16 Tx,±TAf while generally larger, startmg values of 16, 8 
can also be output from each of multiplexers 310 and and 4 (and 0) are on the lejFt side of the two-term expres- 
320.) sion Ti,±T3/. The Tl and Tm outputs of respective 

Comments made with regard to the effective bit multiplexers 310 and 320 follow this strategy, 
widths of FIG. 1 apply equally to the effective bit 65 Two power of two terms can not be used alone to 
widths shown in FIG. 3. Although input port 330a is form the target multipliers, 11/16 and 13/16. One solu- 
shown to have an effective width of n+4 bits, there are tion would be to do a summation or difference of three 
only n+2 variable input bits at input port 330a of the terms. If this were to be done, the 8+2+ 1 and 8+4+ 1 
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10 



15 



options are preferred over the 16—4—1 and 16—4+1 
options because the former options do not require sign 
selection for the third term. The + 1 portion of summa- 
tions S-hZ-fl and 8+4+1 would require an additional 
multiplexer (not shown) for selectively outputting 
A* 1/16 or A*0 and an additional input port (not shown) 
on the adder/subtractor imit 330 for adding the output 
of the additional multiplexer (not shown) into the result 
signal C. The mapping control unit 350 would have to 
have a further control output (not shown) to support 
this additional multiplexer (not shown). The problem 
with such a solution, however, is that the size and com- 
plexity of the overall circuit would have to grow to 
support this additional capability. 

Another solution is to simply accept a certain degree 
of imprecision in the output of the n-by-4 bit pseudo 
multiplier module 300. This can be done in appUcations 
which can tolerate some degree of imprecision in the 
less significant bits. If one were to use the output for 
A* 10/1 6, for example, as an approximating substitute for 20 
A* 11/16, the error would be —A/16. If one were to 
further use the output for A- 12/16 as an approximating 
substitute for A-13/16, the error would also be —A/16. 
The output is correct for all other values of B = P m the 
range P equals 0 to 15. 

Note that the sign of the error can be varied to 
+ A/16 for either or both cases of B= 1 1 and B= 13 by 
using the other adjacent results of B= 12 as a substitute 
approximation for B = 1 1 and usmg B = 14 as a substitute 
approximation for B=13. The amount and polarity of 
error is determined by the mapping performed in map- 
ping unit 350. Certain DSP appUcations, particularly 
those where the multiplication result signal C is aver- 
aged over time, can tolerate a small error that fluctuates 
uniformly from positive to negative. This balanced 
fluctuation tends to produced a statistically self-cancel- 
ing error. 

Like module 100, the module 300 shown in FIG. 3 
can be modified to produce a more generalized set of 
selected graduations, C=A.P/16, of the input A signal, ^ 
for a predefined subset of P values selected from a suit- 
able mteger set (e.g., -21 to +21). See the below Ta- 
ble-3 and note that values P= 1 1, 13 (from Table-2), 91, 
21, 23, 25, 27 and 29 can be approximated with an error 
of one part per 16. The output signal C can approximate 
the Unear transfer form; C=A-(B+F)/16, where B is a 
supplied mdex signal and F is an offset constant, or the 
NONhnear transfer form: C= A-f(B)/16, where f(B) is a 
conversion function for converting each input value of 
the index signal B to a predefined and corresponding 
value of P. 

Suppose one wished to perform an n-by-5 bit multipli- 
cation. The two and three-term options shown by 
below Table-3 in combination with above Table-2 could 
be used. But this presents the disadvantageous result of 
larger circuit size and complex control as already men- 
tioned in regard to the n-by-4 bit pseudo multipher 
module 300. 



25 



30 



35 



45 



50 



55 



TABLE-3 



60 



Target Sum of 
Value = 2 Terms 



Sum of 2 
Terms 



Sum of 3 
Terms 



Sum of 3 
Terms 



17 16+1 

18 16 + 2 
19 

20 16 + 4 

21 

22 

23 



16-4 



16 + 2+1 

16 + 4+1 
16 + 4 + 2 
16 + 8-1 



65 



18 

TABLE-3-continued 



Target Sum of 
Value = 2 Terms 



Sum of 2 
Terms 



Sum of 3 
Terms 



Sum of 3 
Terms 



24 16 + 8 

25 

26 

27 

28 32 - 4 
29 

30 32 - 2 

31 32 - 1 

32 32 + 0 



32-8 



16 + 8 + 1 
16 + 8 + 2 



32-8+1 
32-4-2 
32-4-1 

32-4+1 



16 + 8-2 
32-8-1 



FIG. 4A shows an alternative approach. The n-by-3 
bit module of FIG. 1 is combined with the n-by-4 bit 
pseudo multiplier module of FIG. 3 to provide an n-by- 
7 bit true mxiltiplier module 400 which can also function 
as an n-by-8 bh pseudo multipher. 

The n-by-3 bit module (100') generates, without er- 
ror, "coarse" multiples of input signal A in graduations 
of A divided by eight as defined by: A-P/8 for P=0 to 
7 or P=l to 8. The n-by-4 bit module (300') generates 
"finer adjusting" multiples of input signal A in gradua- 
tions of A divided by two hundred fifty six, as defined 
by: A.P7256 for P'^0 to 15 or F = l to 16 (but with 
approximation errors at P'— 1 1 and P'= 13). 

The A-P7256 fine graduations are added to or sub- 
tracted from the A-P/8 coarse graduations in order to 
provide a full spectrum of values A-P"/128 for the 
range, P"=0 to 128, thereby defining a true (i.e. , nu- 
merically precise) 7-bit by n-bit multiplier. Alterna- 
tively, the A-P'/256 fine graduations are added to or 
subtracted from the A-P/S coarse graduations in order 
to provide an almost full spectrum of values A-P"'/256 
for the range, P"'=0 to 256. (A.P"y256=A-P/- 
8±A-P7256.) The latter defines the n-by-8 bit pseudo 
multipher which has a small error of plus or minus one 
least significant multipHcand bit at the following thirty- 
two points: F"= 1 1, 13, 19, 21, 43, 45, 51, 53, 75, 77, 83, 
85, 107, 109, 115, 117, 139, 141, 147, 149, 171, 173, 179, 
181, 203, 205, 211, 213, 235, 237, 243 and 245. 

In FIG. 4A, an n-bits wide, multiphcand signal (A) is 
suppUed m parallel mto the n-by-i multipher module 
400 over an (n-h l>bits wide vertical bus 409. (Read the 
notation "n-by-J" as n-by-true7 or n-by-pseudo8.) The 
extra bit of bus 409 is the zero value bit (ground line). A 
7 or 8-bits wide, multipher signal (B) is mtroduced into 
the n-by-i multiplier module 400 over an 8-bits wide 
control bus 450jc. In the case where B is less than 8-bits 
wide, the less significant bits (LSB's) of control bus 
450^ are set to the zero value (grounded) or alterna- 
tively, the superfluous logic within mapping unit 450 
which rehes on such LSB's is eliminated. 

The n-by-3 module 100' of FIG. 4A represents the 
upper part of FIG. 1, above section hne 401. The n-by-4 
module 300' represents the upper part of FIG. 3, above 
section line 403. 

Ri represents a first partial result signal output by the 
n-by-3 module 100'. This first partial result signal K\ is 
11+3 bits-wide. As in FIG. 1, control signals I, J, K are 
appUed to the n-by-3 module 100' so that the first partial 
resxilt signal Ri will represent gradations of input signal 
A divided by 8. Ri=A.P/8 for P equal to 0 through 7 
(or alternatively, 1 through 8). 

A *\/X6 prescaler symbol 404 is drawn before the 
n-by-4 module 300' of FIG. 4A to mdicate that, unlike 
FIG. 3, the n-by-4 module 300' outputs a second partial 
result signal R2 representative of gradations of A di- 
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vided by 256. (R2=A.py256 for P' in the range 0 
through 15 or alternatively, 1 through 16 with approxi- 
mations at P' = 1 1 and P' = 1 3.) The second partial result 
signal R2is n-i-8 bits-wide. Control signals L, M, N are 
applied to the n-by-4 module 300' for producing desired 5 
values of second partial result signal R2 in the range 
A.0/256 to A-15/256 (or A.1/256 to A.16/256). 

It is to be understood that the * 1/1 6 prescaling opera- 
tion of symbol 404 is preferably combmed with the ♦I 
through *1/16 operations shown in FIG. 3, so that, 10 
rather than performing the *1/16 operation of symbol 
404 first and foUowing it with subsequent *l through 
*1/16 operations within module 300', hardwired con- 
nections are made between bus 409 and the multiplexers 
310 and 320 of n-by-4 module 300' for effecting, in one 15 
step, the resultant shift operations of: A* 1/16 (supplied 
to input port 310* of FIG. 3), A* 1/32 (supplied to input 
port 310c), A*l/64 (suppUed to 310J), A*l/64 (suppUed 
to 320fl), A*l/128 (supplied to 320i>), and A*l/256 
(supplied to 320c). The symbol 404 is shown outside of 20 
module 300' in FIG. 4A to emphasize that the finest 
(nonzero) gradation of A that can be represented by 
second partial result signal R2 is one sixteenth the finest 
(nonzero) gradation of A that can be represented by 
first partial result signal Ri. 25 

First and second partial result signals, Ri (n+3 effec- 
tive bits- wide) and R2 (n-|-8 effective bits- wide), are 
supplied to respective input ports 430fl and 4306 of 
adder/subtractor unit 430 for producing, in response to 
a sign select signal S that is applied to control port 430x 30 
of unit 430, a result signal C representing either the sum 
or difference between the first and second partial result 
signals, Ri and R2, at output port 430a Result signal C 
is n -1-8 or fewer bits-wide depending on the degree of 
precision and accuracy desired. Adder/subtractor unit 35 
430 is preferably made from a carry-rippling adder so 
that it will be compact in size, A faster but larger, carry 
look-ahead design can, of course, be used as an alterna- 
tive in implementing the adder/subtractor unit 430. 

Comments made with regard to the effective bit 40 
widths of FIGS. 1 and 3 apply equally to the effective 
bit widths shown in FIG. 4A. Although input port 4306 
is shown to have an effective width of n-|-8 bits, there 
are only n-l-4 variable input bits at input port 4306 of 
the adder/subtractor unit 430. These align with the less 45 
significant n-f-4 bits of the n+i bits output at output 
port 430o. The n-f-3 variable input bits at input port 
430a of the adder/subtractor unit 430 align with the 
more significant nH-3 bits of the n-t-i bits output at 
output port 430o. 50 

A mapping control unit 450 is further provided in the 
n-by-i multiplier module 400 as shown. The mapping 
control unit 450 receives the 7 or 8-bits wide, multiplier 
signal (B) maps B to P and P', and converts the P and P' 
values (coarse approximation and fine adjustment val- 55 
ues) to appropriate control signals I, J, K, L, M, N and 
S so that result signal C represents the product 
A<B/128) for B representing values between 0 and 128 
(the LSB of an 8-bit B signal is held at 0); or so that 
result signal C represents the product A-(B/256) for B 60 
representing almost all values between 0 and 256; the 
exception occurring as mentioned before for values that 
require a sum of three terms in n-by-4 module 300' 
rather than a sum of just two terms. 

Mapping control unit 450 is preferably formed as a 65 
combinatorial logic circuit which has the 8-bits wide 
bus 450x as its input and control signals I, J, K, L, M, N 
and S as its outputs. Mapping control unit 450 operates 
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in accordance with principles set forth in the above 
Table- IB and Table-2 or equivalents of these tables. 
Preferably, one starts at a "coarsely-granulated" Ri 
value that is closest to a desired target value, and then 
graduates up or down by a smaller and more "finely- 
granulated" R2 value in order to reach the desired tar- 
get value. The combinatorial logic circuitry of mapping 
control unit 450 can be designed using weU-known 
Kamough mapping techniques or the like. Alterna- 
tively, mapping control unit 450 can be formed from an 
appropriately programmed ROM, PROM, EEPROM. 
or other suitable random access memory means. The 
signal propagation time of the combinatorial logic im- 
plementation, from the B input port 450x to the I, J, K, 
L, M, N and S output ports, is generally shorter though, 
and the circuit size of the combinatorial logic imple- 
mentation is generally smaller. Thus the combinatorial 
logic implementation is usually preferred. 

The below Table-4 shows a representative mapping 
function where P is an 8-bits wide binary-coded signal 
applied to control bus 450x and I, J, K, L, M, N and S 
are the correspondingly output control signals. The 
notations in Table-4 for I, J, L and M define the selected 
negative power i in the expression "A*2-'*' if given as 
a number, or selection of A*0 if given as a triple dash "- 
- - The values in Table-4 for sign selection signals K, 
N and S indicate addition if equal to zero ("0") and 
subtraction if equal to one ("1"). The symbol "x" indi- 
cates a don't care. 

TABLE-4 



Index 


Module 100' 


Module 300* 


Module 


P 


I 


J 




L 


M 




430 


0-.256 


0:1 


2:3 


K 


4:6 


6:8 


N 


S 


0 






X 






X 


0 


1 






X 




8 


0 


0 


2 






X 




7 


0 


0 


3 






X 




8 


I 


0 


4 






X 






X 


0 


5 






X 




8 


0 


0 


6 






X 




7 


1 


0 


7 






X 




8 


1 


0 


8 






X 






X 


0 


9 






X 




8 


0 


0 


10 






X 




7 


0 


0 


U 






X 




7 


0 


0 


12 






X 




6 


0 


0 


13 






X 




6 


0 


0 


14 






X 




7 


1 


0 


15 






X 




8 


1 


0 


16 






0 






X 




17 






0 




8 


1 




18 






0 




7 


1 




19 






0 




6 


0 




20 






0 




6 


0 




21 






0 




7 


0 




22 






0 




7 


0 




23 






0 




8 


0 




24 






0 






X 




25 






0 




8 


1 




26 






0 




7 


1 




27 






0 




8 


0 




28 






0 






X 




29 






0 




8 


I 




30 






0 




7 


0 




31 






0 




8 


0 




32 






0 






X 


0 


33 






0 




8 


0 


0 


34 






0 




7 


0 


0 


35 






0 


6 


8 


1 


0 


36 






0 


6 




X 


0 


37 






0 


6 


8 


0 


0 


38 






0 


5 


7 


1 


0 


39 






0 


5 


8 


1 


0 


40 






0 


5 




X 


0 


41 






0 


5 


8 


0 


0 



09/30/2004, EAST version: 1.4.1 



21 

TABLE-4-continued 



5,402,369 



22 

TABLE-4-continued 



Index 

P 
0:256 



42 
43 
44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 
6S 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
92 
93 
94 
95 
96 
97 
98 
99 
100 
101 
102 
103 
104 
105 
106 
107 
108 
109 

no 

111 
112 
113 
114 
115 
116 
117 

lis 

119 



Module lOff 



I 

0:1 



J 

2:3 



Module 300' 
L M 
4:6 6:8 ^ 



— 7 
6 8 
6 
6 
5 
5 
5 
5 
5 
5 
5 
5 
4 
4 
4 
4 
4 
5 
5 
5 
5 
5 
5 
5 
5 
6 
6 
6 



0 
0 
0 
0 
I 

8 1 

— X 
8 1 

1 
0 
0 
0 
0 

8 0 

— X 
8 1 

7 1 

8 0 

— z 
8 1 

7 0 

8 0 

— X 
8 0 

0 

8 1 

— X 
8 0 

7 1 

8 1 



8 1 

7 1 

8 0 



8 1 

7 0 

8 0 

— X 
8 0 

7 0 

8 1 

— X 
8 0 

7 1 

8 1 



Module 
430 
S 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Index 


Module 100' 


Module SOC 


Modu] 


P 


I 


J 




L 


M 




430 


0:256 


0:1 


2:3 


K 


4:6 


6:8 


N 


s 


120 


1 


— 


X 


5 


— 


X 


1 


121 


1 


— 


X 


5 


8 


1 


1 


122 


1 


— 


X 


5 


7 


1 


I 


123 


1 


— 


X 


6 


8 


0 


1 


124 


1 


— 


X 


6 


— 


X 


1 


125 


1 


— 


X 


6 


8 


1 


1 


126 


1 


— 


X 




7 


0 


1 


127 


1 




X 


— 


8 


0 


I 


128 


1 


— 


X 


— 


— 


X 


0 


129 


1 


— 


X 


— 


8 


0 


0 


130 


1 


— 


X 


_ 


7 


0 


0 


131 


1 


— 


X 


6 


8 


1 


0 


132 


I 


— 


X 


6 


— 


X 


0 


133 


1 


— 


X 


6 


8 


0 


0 


134 


1 


— 


X 


5 


7 


1 


0 


135 


1 


— 


X 


5 


8 


1 


0 


136 


1 


— 


X 


5 


— 


X 


0 


137 


1 


— 


X 


5 


8 


0 


0 


138 


1 


— 


X 


5 


7 


0 


0 


139 


1 


— 


X 


5 


7 


0 


0 


140 


1 




X 


5 


6 


0 


0 


141 


1 


— 


X 


5 


6 


0 


0 


142 


1 


— 


X 


4 


7 


1 


0 


143 


I 




X 


4 


8 


1 


0 


144 


1 


3 


0 


4 





X 


1 


145 


1 


3 


0 


4 


8 


1 


1 


146 


1 


3 


0 


4 


7 


1 


i 


147 


1 


3 


0 


5 


6 


0 


1 


148 


1 


3 


0 


5 


6 


0 


1 


149 


1 


3 


0 


5 


7 


0 


1 


150 


1 


3 


0 


5 


7 


0 


1 


151 


1 


3 


0 


5 


8 


0 


1 


152 


1 


3 


0 


5 





X 


1 


153 


1 


3 


0 


5 


8 


1 


1 


154 


1 


3 


0 


5 


7 


1 


1 


155 


1 


3 


0 


6 


8 


0 


1 


156 


1 


3 


0 


6 





X 


1 


157 


1 


3 


0 


6 


8 


1 


I 


158 


1 


3 


0 





7 


0 


1 


159 


1 


3 


0 





8 


0 


1 


160 


1 


3 


0 








X 


0 


161 


J 


3 


0 


— 


8 


0 


0 


162 


1 


3 


0 





7 


0 


0 


163 


1 


3 


0 


6 


8 


1 


0 


164 


1 


3 


0 


6 


_ 


X 


0 


165 


1 


3 


0 


6 


8 


0 


0 


166 


1 


3 


0 


5 


7 


I 


0 


167 


1 


3 


0 


5 


8 


I 


0 


168 


1 


3 


0 


5 


— 


X 


0 


169 


1 


3 


0 


5 


8 


0 


0 


170 


1 


3 


0 


5 


7 


0 


0 


171 


1 


3 


0 


5 


7 


0 


0 


172 


1 


3 


0 


5 


6 


0 


0 


173 


1 


3 


0 


5 


6 


0 


0 


174 


1 


3 


0 


4 


7 


I 


0 


175 


1 


3 


0 


4 


8 


1 


0 


176 


0 


2 


1 


4 





X 


1 


177 


0 


2 


1 


4 


8 


I 


1 


178 


0 


2 


1 


4 


7 


1 


1 


179 


0 


2 


1 


5 


6 


0 


1 


180 


0 


2 


1 


5 


6 


0 


1 


181 


0 


2 


1 


5 


7 


0 


1 


182 


0 


2 


1 


5 


7 


0 


1 


183 


0 


2 




5 


8 


0 


1 


184 


0 


2 




5 




X 




185 


0 


2 




5 


8 


I 




186 


0 


2 




5 


7 


1 




187 


0 


2 




6 


8 


0 




188 


0 


2 




6 




X 




189 


0 


2 




6 


8 


1 




190 


0 


2 






7 


0 




191 


0 


2 






8 


0 




192 


0 


2 








X 


0 


193 


0 


2 






8 


0 


0 


194 
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Note that above Table-4 is periodic and repeats in the 
less significant columns (L,M,N) every 32 rows. This 55 
redundancy can be used to minimize the circuitry of 
mapping unit 450, as is well understood in the art. 

FIG. 4B is a plot showing the gain and error for one 
implementation of the n-by-8 bit pseudo multipher mod- 
ule 400 for values of B equal to 0 through 255. Ideally, 60 
the gain curve C/A should be a straight line extending 
from the point B=0, C/A=0, to the point B=255, 
C/A= 1.00. But as seen, the C/A curve deviates from 
the ideal, in the aggregate, for roughly one eighth of its 
overall length, while remaining on the ideal path for 65 
roughly, in the aggregate, seven eighths of its overall 
length. The deviation is better seen by the plot of the 
error, E=C-(A-B). The maxiTnuTn error is plus or minus 



1/256, which is just under four tenths of a percent. The 
direction of the error spikes can be varied, as already 
explamed above, by picking appropriate substitutions 
for the cases of 1 1/16 and 13/16 as applied within the 
n-by-4 module 300'. The selected substitutions can cre- 
ate errors that are either all positive, all negative, coun- 
terposed positive and negative, or any other desired 
distribution at the points defined by P"'=ll, 13, 19, 21, 
43, 45. 51, 53, 75, 77, 83, 85, 107. 109, 115, 117, 139, 141, 
147, 149, 171, 173. 179, 181, 203, 205, 211, 213. 235, 237, 
243 and 245. 

In the illustrated case, the error E distributes uni- 
formly over B equals 0 to 255, and symmetrically rela- 
tive to the zero error line. As such, this implementation 
is well suited for DSP applications that can tolerate a 
small, uniformly distributed error function. One such 
application is xise of the n-by-pseudo8 multipher module 
400 for converting YCC formatted video signals into 
composite video signals. The format conversion re- 
quires quadrature modulation of the color baseband 
signals to form a corresponding chromiance signal using 
a process that relies on a set of multiplications. Both the 
CRT tube and the human eye tend to integrate out small 
errors in chromiance over space and time. As such, the 
small deviations between what is displayed and what 
would have been displayed if the chromiance had been 
generated by a true n-by-8 multipher instead of by the 
n-by-i multiplier module 400, is tolerable. 

It is recognized in the art of modulator design that 
one obtains approximately 6 db (decibels) for each addi- 
tional bit added to the multiplier (B) signal. Under this 
sense, the least significant multipher bit of the n-by- 
pseudo8 multiplier module 400 can be said to add an 
addition of approximately 5 db (which is roughly iths 
of 6 db) to the resolution or gain of the system. This 
additional 5 db is usable in apphcations where the ac- 
companying error can be tolerated as is, or statistically 
averaged out. 

Referring momentarily back to FIG. 4A, it should be 
noted that the compact layout topology 200 of FIG. 2 
can be appUed to FIG. 4A. Bus 409 runs m parallel next 
to the multiplexers 110, 120, 310, 320 of n-by-3 module 
100' and n-by-4 module 300'. Control hues I, J, K. L, M, 
N and S also run in parallel through the various multi- 
plexers of elements 100', 300' and 430 for providmg 
appropriate shift-amount selection and sign-selection 
functions. 

The signal propagation delay of the n-by-i multipher 
module 400 is relatively small. Adder/subtractor unit 
430 can begin to generate the less significant bit of the 
result signal C as soon as the less significant bits (LSB*s) 
of the first and second partial result signals, Ri and Kt, 
become vahd. The general-purpose propagation time 
for performing an n-by-i multiphcation (assuming that 
new values are supplied at the same time for both the 
muldphcand A signal and the multipher B signal) is 
roughly equal to the sum of time delays through map- 
ping unit 450, plus the time for generating the less signif- 
icant result bits in the n-by-3 module 100' and in the 
n-by-4 module 300', plus the tune for propagating the 
rest of the result out Irom the n-by-3 module 100' and in 
the n-by-4 module 300'. This general-purpose propaga- 
tion time is reduced somewhat due to partial concur- 
rence of delays through mapping unit 450 and delays 
through n-by-3 module 100' and n-by-4 module 300'. 

FIG. 5 is a block diagram of an n-by-m bit true multi- 
pher system 500 or, alternatively, an n-by-(m+l) bit 
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pseudo multiplier system or, alternatively, an n-by- 
m/(m-|- 1) "graduator" system in accordance with the 
invention. This structure is formed by generalizing the 
structure shown in FIG. 4A. Coarse approximations are 
corrected with ever finer adjiistments to realize a de- 
sired level of accuracy and/or precision. 

An n-lines wide bus 509 carries the n bits of the multi- 
plicand signal (A). A series of n-by-3 modules (nx3) 
511, 512, 513, . . , , 51/ are operatively coupled to this 
multiplicand A signal supply bus 509. Each of the n-by- 
3 modules (nX3) 511, 512, etc., preferably has a struc- 
ture corresponding to module 100' of FIG. 1 and more 
preferably according to the compact topology 200 of 
FIG. 2. One or more copies of the n-by-m/(m+ 1) mul- 
tiplier system 500 may be placed either integrally on a 
monolithic integrated circuit (IC) chip which has bus 
509 extending on the chip or on a multi-chip carrier 
module (MCM) or multi-chip printed circuit board 
which has bus 509 extending on the MCM substrate or 
on the board. 

The topology of system 500 can even be conveniently 
programmed into a field-programmable gate array de- 
vice (FPGA) such tiiat of the Xilinx XC4000 TM family 
(available from Xilinx Inc. of San Jose, Calif,) because 
the control signals can be subdivided into small parallel 25 
clusters of 3 or 4 bits each and the n bits-wide multipli- 
cand A signal can also be parsed into clusters of bits that 
are operated on by the sniall clusters of control signals. 

The n bits-wide, multiplicand A signal supply bus 509 
connects directly to the multiplicand input of the first 30 
n-by-3 module 511. Consequentiy, the first partial result 
signal Ri of module 511 has an effective width of n+3 
bits and represents multiples of A in minimal gradua- 
tions of eighths. (Ri = A-P/S for P=0 to 7 or P = 1 to 8.) 

A *2-4prescaler 504 is interposed between the multi- 35 
plicand A signal supply bus 509 and the second n-by-3 
module 512. The second partial result signal R2' of 
module 512 accordingly has an effective width of n-t-7 
bits and represents multiples of A in "more finely- 
granulated" minimal graduations of 128ths. 40 
(R2'=A.P7128 for F=0 to 7 or F=l to 8.) 

In other words, the minimal graduations of the sec- 
ond module 512 are one-sixteenth (l/16th) the mmiTn^tl 
graduations of the first module 511. Note that variable 
P' has only eight quantization levels (P'-O to 7 or F = 1 
to 8). Note further, however, that the graduations pro- 
duced by the second module 512 will be selectively 
added to, or subtracted from, the graduations produced 
by the first module 511 in a not-yet introduced, adder/- 
subtractor unit 531. 

The capability to both add and subtract the finer 
graduations of Uie second module 512 gives the eight, 
so-called P' quantization levels an effective range of 16 
points rather than just 8. This is why prescaler 504 is set 
to provide prescaling by one-sixteenth rather than by 55 
one-eighth. And because of this, the combination of the 
second n X 3 module 512 with the first n X 3 module 511 
synergjstically defines part of an n X 7 multiplier rather 
than merely an nx 6 multiplier. (In terms of synergistic 
math, it appears that 3n+3n becomes 7n instead of 6n. 
Actually, there is no magic here. The 7th "n" comes 
from the sign selection bit SSI applied to the subsequent 
adder/subtractor unit 531.) The nX 7 multiplier is com- 
pleted by adding the not-yet but soon-to-be introduced, 
adder/subtractor unit 531 and mapping control unit 65 
550. 

A *2- 8 prescaler 508 is interposed between the multi- 
plicand A signal supply bus 509 and the third n-by-3 
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module 513. The third partial result signal R3 of module 
513 therefore has an effective width of n-fll bits and 
represents multiples of A in "even more finely- 
granulated" minimal graduations of A/2048. 
(R3=A.F72048 for P=0 to 7 or P=l to 8.) 

The granularity of partial result values generated by 
each next successive module (e.g., 514, not shown) 
continues to reduce in this manner. Each next succeed- 
ing nX3 module produces a partial result signal repre- 
senting multiples of A in minimal graduations that are 
one-sixteenth (=2-'*) that of the previous, more-signifi- 
cant module so as to fill in value gaps between the 
coarser gradxiations of the previous module. 

Accordingly, the i-th n-by-3 module (shown as mod- 
ule 510 of the series 511, 512, etc. has a *2-'*' prescaler 
50 X interposed between it and the multiplicand A 
signal supply bus 509, where X=4{i — 1). The i-th par- 
tial result signal R/ of module 51/ has an effective width 
of n+X-f3 bits (=n-l-4(i-l)+3 bits) and represents 
20 multiples of A in "even more finely-granulated" mini- 
mal graduations of A*2-(4('-i)-J-3). (R/=A.P'*2-(- 
4(i-i)+3) for p/=0 to 7 or P'= 1 to 8.) 

The last module 51/ in the series, 511-51/, is prefera- 
bly an n-by-4 module (nX4) structured similarly to 
module 300' of FIG. 3. This j-th module has a *2-Y 
prescaler 50 Y interposed between it and the multipli- 
cand A signal supply bus 509, where Y=4(j— 1). The 
j-th partial result signal R/of module 51/ has an effective 
width of n+Y+4 bits (=n+40-l)H-3-f 1 bits) and 
represents multiples of A in minimal graduations of 
A*2-(4(/-l)+4). (Ry=A.P/*2-W-l)+4) for py=0 to 15 
or PJ— 1 to 16.) The n-by-4 module 51/ produces a par- 
tial result signal Kj representing multiples of A in mini- 
mal graduations that are one-thirtysecond (l/32nd) that 
of the previous module. 

A series of adder/subtractor units 531, 532, . . . , S3k 
are further provided in the n-by-m/(m-l- 1) multiplier or 
"graduator" system 500 as shown, each having a corre- 
sponding sign-select terminal, SSI through SSk. The 
Ri output of first module 511 defines a first-level ap- 
proximation of the desired target output, C. Adder/sub- 
tractor unit 531 receives the coarse RI and finer-adjust- 
ing R2' partial result signals, and produces a second- 
level, partial result signal R12 representing the sum or 
difference of the RI and R 2' partial result signals. Ad- 
der/subtractor unit 532 receives the R12 and R3 partial 
result signals, and produces a third-level partial result 
signal R23 representing the sum or difference of the R12 
and R3 partial result signals. The granularity of R3 is 
finer than that of R2 so that finer adjustments can be 
made to the second-level, partial result signal R12 in 
order to get closer, when necessary to the desired target 
output, C. This sequence continues through to the last 
adder/subtractor unit 5dk which receives the partial 
result signal Rhi from the preceding adder/subtractor 
unit SZh (not shown) and also the Rj partial result signal 
from the nx4 module 51/ The last adder/subtractor 
unit SSk outputs result signal C. The result signal C has 
&n effective bit width of n+m-}-(l or 0) =n+Y+3-J-(l 
or 0), the last bit being added if pseudo multiplication is 
to be tolerated or not added if true multiplication is to 
be required. 

There are k=:(j-l) adder/subtractor units 531-53^ 
in the n-by-m/(mH-l) multiplier system 500 of FIG. 5. 
When these adder/subtractor units 531-53Jt are imple- 
mented in accordance with a carry ripple design, signal 
propagation delay through system 500 can be defined as 
n-f (j— 2) ripple-carry delays, where a ripple carry 
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delay is the time needed to generate the least significant vided by such an n X 4 graduator module is usable in the 

output bit of one such ripple-carry adder unit. application environment. 

A mapping control unit 550 is further provided in the It is to be further noted that the shift operations of 

n-by-m/(m+ 1) multiplier or "graduator" system 500 as prescalers 504, 508, . . , , 50/, . . . , 50y are preferably 
shown, for receiving an m-bits wide (or alternatively an 5 integrated into the internal shift operations of their 

m-i- 1 bits-wide), multiplier defining signal (B) at an respective graduator modules 512, 513, . . . , 51j, . . . , 51/ 

mput port 550x and mapping that signal (B) into corre- rather than being performed separately. The prescaiing 

spending values of P,P',P'', etc and converting these P, factor does not of necessity have to geometrically 

P', P", etc. values into appropriate control signals 555 progress by amounts of ♦2-'*. This is merely considered 

that are applied to modules 511 through 51/ and into the ^® optimum amount for obtaining maximum perfor- 

sign-select signals, SSI through SSk that are applied to mance from each graduator module added to the sys- 
adder/subtractor units 531 through SSk. Mapping con- 

trol unit 550 can be configured so that result signal C ^^^^^ graduator module does not have to be of 
represents or substantially approximates the value A-P- an n X 3 type. In some circumstances it may be advisable 
/2«for all integer values of P in the range 0 to 2«- 1 (or performance by adding nX2 graduator mod- 
altemativdy, 1^P^2«), where u is an integer defining ^ graduator module has substantially the 
a desired upper level of precision (u is equal to m or ^ ^ * the nX3 graduator 
m-h 1) and P is a mapped function of the input signal B "^^^^^ ^^^P^ P^escaler 103 is deleted, mul- 
that is presented to control port 550;c of mapping unit tiPfexer input port 1206 is deleted and mapping control 
550 ^'^ ° 20 umt 150 is simplified to ehmmate the selection multi- 
It is to be noted that the last graduator module in the P^^^^' ^P"* port 1206. In such a case, the minimum 
series 511-5V does not have to be an nX4 module. It fjf'^^^'Z f ^''^ ^ ^ ^^"^ 
can instead be an nx3 module (or even a graduator of instead ot A/8, 
fewer last-stage graduations such as an nx2 module „ «,,®/^'Hrim^^^ 

described later, bdow). The bit width, m, of the multi- ^Zr Z ^ ^ I T ^^'IT)"^ ^""^"^ 

plier B input port 55(bc will vary in general, in accor- 'Z 'n^ ^ ^^T^ ^'^.''^ P^^^^, ^ 

dance with thrnumber of nx3^Jltor m^iules that "^^i^ ^ i^"^ coirespondmg (nrcmtry m multip ex- 

are used and the nmnber of A.P^^alues that are to be ^""i^.^oA^ol ^nit' iSTZtl'T^^ "^"^"^ 

11 T ^""^ ' ^"^^P^^^.S ''^f.'^l:'.^ ^'"t"^/^' tor module ^^ be referred to here asan nX 1.5 ^adua- 
Addmg the second gr^uator mod^^ 51 (and the first .^r module. It can produce a result signal C represent- 
^der/subtractor umt 531) extends die aUowed resolu- ing A*[(0ora5)±(0or0.25)], Insome^cumsLcesit 
?7°%? .^^? rj^ ^ ^l l l'^^. '° ^ ^f^'^t may be usef\il to use such a nx 1.5 graduator module as 
?Z\t ^ ^' ^"^^f ^ the thnrd graduator module 513 35 a last or otiier stage in a graduator system. In yet other 
(and the second adder/subtractor umt 532) extends the circumstances it may be useful to use an even more 
allowed resolution of the multiplier B signal by anotiier primitive, nX 1 graduator module as a last or other stage 
4 bits so that It can be U bits-wide in a graduator system. An nXl graduator module is 
(ll=3+3+3+H-l). Each further added n X 3 gradua- simply tiie prescale-shifted multiplicand A bus with no 
tor module extends the allowed resolution of the multi- 40 processing logic attached. 

plier B signal by another 4 bits. If the last added gradua- Referring back to FIG. 5, the illustrated circuit con- 
tor module is an nX4 type mstead of an nX3 type, the figuration is disadvantageous from the point of view of 
corresponding extension of B is by 5 bits instead of by 4 scalability because signal propagation delay increases at 
^^ts. least linearly as more and more graduator modules are 

The below Table-5 shows this progression of im- 45 added, 
proved resolution in the B signal as one adds more nX 3 FIG. 6 shows another, more preferred topology 600. 
graduator modules. Like reference symbols and numbeis in the "600" series 

TABLE-5 ^ elements of FIG. 6 which correspond to but 

are not necessarily the same as the elements represented 
50 by similar symbols and reference numbers of the "500" 
series in FIG. 5. As such, a detailed description of the 
elements foimd in FIG. 6 is omitted here. 

Rather than cascading aU the adder/subtractor units 
in series as was done in FIG. 5, the adder/subtractor 
55 units 631-(aA, 6^1-6^2 and 660 of FIG, 6 are arranged 
in a parallel/serial fashion. The resultant concurrency 
of merge operations, where fine adjustment values are 
added to or subtracted from coarser approximation 
The general formula for the optimal number of usable B values, helps to reduce the worst-case signal propaga- 
bits is: 60 tion time of the illustrated topology 600. 

In a first sum/difference forming stage 630, adder/- 
Pneision in number of bits=3j-^k+(0 or l); subtracter unit 632 begins to form a sum or difference of 

partial result signals, R3 and R4, at the same time that 
wherem J is the total number of n X 3 or better graduator adder/subtractor unit 631 begins to form a sum or dif- 
modules, k is the number of adder/subtractor units and 65 ference of partial result signals. Ri and R2. Adder/sub- 
the last term is a one or zero depending on whether or tractor unit 63/i simultaneously begins to form a sum or 
not the last graduator module 51; is a nX4 type and difference of partial result signals, R,*and R;. The num- 
whether or not the last bit of pseudo resolution pro- ber, j, of graduator modules 611-61/ is preferably even 
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SO that respective outputs from each successive pair of amplification by inverters 214 and 215 is optional and 

graduator modules merge into a corresponding stage may be eliminated. Shifting and adding/subtracting can 

630 adder/subtractor unit 631, 632, . . . , 63A, where be each alternatively carried out by serial units rather 

h = j/2. than the illustrated parallel units. Data storage registers 

In a second sum/difference forming stage 640, that 5 may be added where appropriate to hold temporary 
succeeds first sum/difference forming stage 630, pairs of results of serial, over-time operations, 
sums produced by the first sum/difference forming With regard to FIGS. 5 and 6, the higher level adjust- 
stage 630 are added to or subtracted from second stage ment stages (531, 532, 53k, 630, 640, 660) do not have to 
results such as R/;=R/±Ry. And in a third sum/differ- have selectable add and subtract functions. Instead, the 
ence forming stage 660, that succeeds second sum/dif- 10 finer adjustment values (e.g., R2, R4, Rj) can be signed 
ference forming stage 640, pairs of sums produced by values. Recall that each of the finer granularity modules 
the second sum/diflference forming stage 640 are added (512-515, 612-615) can be made to have a transfer func- 
to or subtracted from third stage results such as tion of the form C=A'(B+F)/Q, where B is a supplied 
R^Ay— RgA±Ri7. The stages continue for as many times index signal, Q is a power of two that defines the granu- 
as necessary to produce a result signal C of a desired 15 larity, and F can be a positive or negative offset con- 
precision. The advantage of thb architecture 600 is that stant. If F is an appropriate negative value, C can swing 
it reduces the overall signal propagation delay time. to both plus and minus even though B is always posi- 

In light of the above, those skilled in the art will tive. Thus the finer adjustment signals can have the 

recognize that many other topologies are possible for sign-select built into them rather than occurring in a 

merging: fine adjustment terms (the less significant 20 follow up operation. 

terms) with coarser approximation terms (the more With regard to the polarity of the supplied multipli- 

significant terms), A topology having 2^ graduators, cand A and multiplier B signals, there are four varia- 

where x= 1,2,3, is preferred in order to support a tions: both positive, both negative, A is positive while B 

bmary-tree merging topology of the type suggested by is negative, and A is negative while B is positive. Much 

FIG- ^- 25 of the above discussion has assumed that both A and B 

The above disclosure is to be taken as illustrative of are positive binary-coded digital signals. It is well 
the invention, not as limiting its scope or spirit Numer- within the purview of those skilled in the art to make 
ous modifications and variations wUl become apparent appropriate modifications for handling signed multipli- 
to those skilled in the art after studying the above dis- cations. Either the input signals (A and B) can be con- 
closure. 30 verted to be both positive before application to a multi- 

By way of example, in cases where small circuit size plier circuit in accordance with the invention and the 
is more important than speed; some or most of the result signal C can be afterwards negated as necessary; 
above described operations can be performed in serial- or equivalent shift and control operations can be per- 
over-time fashion rather than in the disclosed parallel- formed within the graduator modules to provide a bi- 
over-space fashion. The underlying process is to form 35 nary-coded result signal C of appropriate polarity, 
first-level, coarse approximation and fine adjustment It is worthwhile to summarize here some of the par- 
signals through physical shifting of a given multiplicand ticularly advantageous features of the invention that 
signal, A. A fine adjustment signal is added to or sub- may have escaped notice in the detailed discussion, 
tracted fi-om a coarse approximation signal in order to First, there is no limit on the size of the multiplicand A 
generate a result signal representing a target value (C) 40 signal. It can be any arbitrary number, n, of bits-wide 
or more closely approximating the target value (C) than (e.g., 16, 32, 64, 128, and so forth). The size of the multi- 
did the initial coarse approximation signal. The process plier B signal is limited primarily by the fact that within 
can be compounded to construct approximations and each sum/difFerence-of-2 terms module, only integers in 
adj;istments of ever finer granularity. the range — 10 to -f- 10 can be precisely formed as a sum 

With regard to FIGS, 1 and 3, it is to be understood 45 or difference of just two term signals (T,and T/), where 

that the number of input ports per multiplexer and the each term signal represents either zero or A multiplied 

shift operations performed at each input port are not by a power of two. For general purposes, the optimal 

hmited to what is shown. The iUustrated shift amounts size for the B signal is believed to be 3 bits and hence, 

can be scaled by any desired power of two. The scale of the n X 3 module 100' of FIG. 1 forms the basic building 

zero through A times 1.00 was used merely for purposes 50 block for creating combination systems that can handle 

of normalization. The number of lines in each of the multiplier B signals of more than 3 bits, 

illustrated buses can be increased or decreased as appro- Second, the n X 3 module 100' of FIG. 1 provides an 

priate. In some applications, the less significant bits of a area efficient structure when unplemented according to 

result signal (R or C) can be truncated away when not the topology 200 of FIG. 2. This topology meshes 

needed and rounding may be provided where desirable 55 smoothly into the multi-module topologies of FIGS. 5 

in combination with such truncation. and 6 because the same multiplicand A signal is distrib- 

With regard to FIG. 2, it is to be understood that uted in parallel by one bus (509, 609) to all the graduator 

there are alternative circuits for implementing the bar- modules. 

rel shifting and summing functions of FIGS. 1 and 3. Third, there is a basic control scheme within the nx3 
The multiplexers can be of a classical AND/OR gate 60 building block module 100' of FIG. 1 that scales nicely 
construct instead of the illustrated pass-transistor con- into the control scheme used by aggregate systems that 
struct. Technologies other than CMOS (complemen- combine many such building blocks. Put simply, coarse 
tary metal-oxide-semiconductor) can be used, such as approximation signals are augmented both construc- 
for example, bipolar transistor logic. Sign selection for tively and destructively by fine adjustment signals to 
the adder/subtractor unit can be implemented with a 65 provide a fuU range of values having a desired granular- 
bank of exclusivc-OR gates (XOR's) following nonin- ity. In the multi-module systems such as shown by 
verting outputs of the multiplexers that determine shift- FIGS. 4A, 5 and 6, this basic control scheme is repeated 
amoimt for each of the term signals (T,-, T/). Signal by providing both additive and subtractive fine adjust- 
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ments of different granularities to coarser approxima- 
tion signals. This has the benefit of doubling the adjust- 
ment range provided by the modules that generate the 
fine adjustment signals. 

The repetition of the basic control scheme within and 5 
outside the nX3 building block modules 100' produces 
a number of periodicities in the patterns of control sig- 
nals (e.g., I,J,K,L>1,N,S) that need to be developed by 
the mapping control units (150, 350, 450, 550 or 650). 
These periodicities permit significant reduction in the 10 
circuit size of the mapping control units (150, 350, 450, 
550 or 650) by means of Kamough or other well-known 
circuit minimization techniques. 

Fourth, output production time is reduced by the fact 
that the multiplicand A signal undergoes sigiiificance 15 
translation (A is barrel shifted) at the same time that 
decoding of the multiplier B signal begins. The concur- 
rence of these operations works to reduce the overall 
signal propagation delay time of the system. 

Fifth, the fundamental operation for providing multi- 20 
plication by a wide variety of multipliers B (where B is 
not necessarily a power of two) is based in the aggre- 
gate on the idea of simultaneously multiplying a same 
multiplicand A signal by two or more powers of two. 
Then sums or differences of the results are generated. 25 
The simultaneous multiplications of the multiphcand A 
signal by various powers of two can be performed 
quickly and with minimal circuitry. This further works 
to reduce the overall signal propagation delay time of 
the system. 30 

Sixth, because the merging of finer adjustments with 
coarser approximation signals happens in successive 
stages (e.g., 630, 640, 660 of FIG. 6), the sign select (S 
or SSi) control signals that control downstream merges 
do not have to be generated as quickly as those 35 
(I,J,L,M) that control upstream selection of the shift 
amounts and those (K,N) that control the first round of 
additions and subtractions. Designers can optimize the 
circuitry of the mapping control unit (e.g., 650) to take 
advantage of this phenomenon by having it generate the 40 
most upstream control signals (I J,L,M) in the least 
amount of practical time, then i^e next needed, mid- 
stream control signals (K^N), and lastly the downstream 
control signals (S or SSi). 

As a seventh and final point, it should be noted that 45 
the concept of an imperfect yet still useful multiplier 
module (FIG. 3) has been mtroduced. A significant 
improvement in the signal to noise ratio of many DSP 
applications can be obtained (approximately a 5 db gain) 
by using the nx 4 design of FIG. 3 as the only or last in 50 
a series of graduator modules. An inherent and statisti- 
cally-based error averaging effect can be found in many 
DSP applications. This will depend on the statistical 
distribution of the various multiplier B codes supplied in 
a given DSP system. Appropriate selection of the polar- 55 
ity of errors at points corresponding to P/Q= 11/16 and 
P/Q= 13/16 can combine with the statistically-based 
error averaging effect to greatiy reduce what at first 
appears to be an intolerable amount of error (or 
"noise")- In applications where the multiplier B signals 60 
tend to have a zero average over time and distribute 
unifonnly in the positive and negative directions, it is 
generally advisable to provide a symmetrical distribu- 
tion of counterposed positive and negative errors as one 
moves across the spectrum of possible values for B. 65 
They will tend to cancel one another out over time. In 
some unidirectional systems such as servo positioning 
systems, it is not desirable to have both positive and 



negative pointing errors since this only works to double 
the positional imprecision of the system. In such a case, 
it will be advisable to instead direct all the errors at 
points corresponding to P/Q= 11/16 and P/Q= 13/16 
to be either all positive or all negative. Variations to 
these general suggestions will of course develop due to 
the peculiarities of specific applications. 

Given the above disclosure of general concepts and 
specific embodiments, the scope of protection sought is 
to be defined by the claims appended hereto. 

What is claimed is; 

1. A multiples generating apparatus for generating 
physical signals representing rational multiples of a 
value A represented by a received, digital multiplicand 
signal (A), the apparatus comprising: 

(a) plural shifting means, each receiving said multipli- 
cand signal (A) and each receiving one of plural 
shift control signals, each of the shifting means 
being for shifting the received multiplicand signal 
(A) and producing therefrom, in response to the 
corresponding shift control signal, a term signal 
either representative of the multiplicand (A) multi- 
plied by two raised to an integer power or repre- 
sentative of a nullity; 

(b) summing means, operatively coupled for receiv- 
ing term signals from two or more of said plural 
shifting means and for producing a physical result 
signal representing a sum or difference of the re- 
ceived term signals; and 

(c) mapping control means, operatively coupled to 
the plural shifting means, for receiving a supplied 
index signal (B) and responsively outputting the 
shift control signals to each of the plural shifting 
means such that the result signal produced by the 
summing means represents a midtiple of the re- 
ceived multiplicand signal (A) having the form, 
A.P/Q, where Q is an integer equal to two raised to 
an integer power (Q=2/) and P is an integer se- 
lected from a series of integers in accordance with 
a predefined mapping of B to P. 

2. A multiples generating apparatus according to 
claim 1 

wherein the simaming means includes sign control 
means responsive to the mapping control means for 
determining whether each one or more of the term 
signals is added or subtracted to produce the result 
signal; and 

wherein the mapping control means outputs a sign 
select signal to the sign control means, in response 
to the supplied index signal (B), for determining 
whether each one or more of tiie termsignals is 
added or subtracted to produce the result signal. 

3. A multiples generating apparatus according to 
claim 1 

wherem P is an integer selected from a sequential 
series of integers in accordance with a predefined 
linear mapping of B to P such that the result signal 
is of the form, R=A.(B-|-F)/Q, where B is the 
supplied index signal and F is an offset constant. 

4. A multiples generating apparatus according to 
claim 1 

wherein P is an integer selected froin the sequential 
series of integers: 0, 1, 2, . . . , Q, Q-hl, Q-l-2, . . . 
, in accordance with a predefined linear mapping of 
B to P such that the result signal is of the form, 
R=A-(B+F)/Q, where B is the suppUed index 
signal and F is an offset constant 
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5. A multiples generating apparatus according to 
claim 4 

wherein Q is equal to 8 and P is an integer selected 
from eight consecutive integers in the series of 
integers: 0, 1, 2, . . . , Q. 5 

6. A multiples generating apparatus according to 
claim 1 

wherein P is an integer selected from the consecutive 
series of integers: 0, 1, 2, . . , , Q, Q+ 1, Q+2, . . . 
, in accordance with a prede&ied mapping of B to 10 
P such that, for a majority of values of P, the result 
signal is of the form, R=A-(B-|-F)/Q, where B is 
the supplied index signal and F is an offset constant. 

7. A multiples generating apparatus according to 
claim 6 IS 

wherein Q is equal to 16 and p is an integer selected 
from fourteen integers in the consecutive series of 
integers: 0, 1, 2, . . . , Q. 

8. A multiples generating apparatus according to 
claim 1 wherein the generated result signal represents in 20 
real-time a physical quantity selected from the group 
consisting of: (d.l) real-time tomography measurement 
signals; (d.2) real-time medical diagnosis or treatment 
signals; (d.3) real-time audio or ultrasonic sound signals; 
(d.4) real-time video signals; (d.5) real-time telecommu- 25 
nication signals; and (d.6) real-time adaptive-filtering 
coefficients. 

9. A multiples generating apparatus according to 
claim 1 wherein the mapping control means includes a 
combinatorial logic circuit 30 

10. A multiples generating apparatus according to 
claim 1 wherein the mapping control means includes 
one of a ROM, PROM, EEPROM or other random 
access memory device. 

11. A multiples generating apparatus for generating 35 
signals representing multiples of a value A represented 
by a received multiplicand signal (A), the apparatus 
comprising: 

(a) a plurality of partial result generating modules 
where each module mcludes: 40 
(a. 1) plural shifting means, each receiving said mul- 
tiplicand signal (A) and each receiving one of 
plural shift control signals, each of the shifting 
means being for shifting the received multipli- 
cand signal (A) and producing therefrom, in 45 
response to the corresponding shift control sig- 
nal, a term signal either representative of the 
multiplicand (A) multiplied by a power of two or 
representative of a nuUit>^ and 

(a.2) first-level summing means, operatively cou- 50 
pled for receiving term signals from two or more 
of said plural shifting means and for producing a 
partial result signal representing a sum or differ- 
ence of the received term signals; 

(b) second-level summing means, operatively coupled 55 
for receiving partial result signals from two or 
more of said plural modules and for producing a 
second-level result signal representing a sum or 
difference of the received partial result signals; and 

(c) mapping control means, operatively coupled to 60 
the plural pl\irality of partial result generating 
modules and to the second-level summing means, 
for receiving a supplied index signal (B), for re- 
sponsively outputting the shift control signals to 
each of the plural shifting means in each module, 65 
and for responsively outputting sign select signals 

to each of the firstlevel and second-level summing 
means, such that the second-level result signal pro- 
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duced by the second-level summing means repre- 
sents a multiple of the received multiplicand signal 
(A) having the form, A-P/Q, where Q is an integer 
equal to two raised to an integer power (Q==2/) and 
P is an integer selected from a series of integers in 
accordance with a predefined mapping of B to P. 

12. A multiples generating circuit for generating elec- 
trical signals representing rational multiples (A-P/Q) of 
a value A represented by a received, binary-coded mul- 
tiplicand signal (A), the circuit comprising: 

(a) a plurality of or-constant shifters, each receiving 
said multiplicand signal (A) and each receiving one 
of plural, or-constant control signals, each of the 
or-constant shifters including circuitry, responsive 
to a corresponding or-constant control signal for 
either shifting the received multiplicand signal (A) 
and producing therefrom a term signal representa- 
tive of the multiplicand (A) multiplied by two 
raised to an integer power or producing a term 
signal representative of a predefined constant; 

(b) a summing circuit, operatively coupled to the 
or-constant shifters for receiving term signals from 
two or more of said plurality of or<onstant shifters 
and for producing a result signal representing a 
sum or difference of the received term signals; and 

(c) mapping control means, operatively coupled to 
the plurality of or-constant shifters, for receiving a 
supplied index signal (B) and responsively output- 
ting the or-constant control signals to each of the 
plurality of or-constant shifters such that the result 
signal produced by the summing means represents 
a multiple of the received multiplicand signal (A) 
having the form, A-P/Q, where Q is an integer 
equal to two raised to an integer power (Q=2/) and 
P is an integer selected from a series of integers in 
accordance with a predefined mapping of B to P. 

13. A multiples generating circuit according to claim 

12 wherein each or-constant shifter comprises: 

(a.1) a multiplexer having a plurality of input ports, 
an output port for outputting a signal presented to 
a selected one of the multiplexer input ports, and a 
selection control port for selecting one of said mul- 
tiplexer input ports; 

(a.2) a plurality of digital shifters, each coupled to 
receive the multiplicand signal (A) and to output a 
corresponding and differently shifted version of 
the multiplicand signal (A*2-') to a corresponding 
one of said multiplexer input ports; and 

(a.3) constant-defining means for applying a constant- 
representing signal to a corresponding one of said 
multiplexer input ports. 

14. A multiples generating circuit according to claim 

13 further comprising: 

(d) a multiplicand supply bus coupled to each of the 
digital shifters for simultaneously supplying the 
multiplicand signal (A) to each of the digital shift- 
ers. 

15. A multiples generating circuit according to claim 

14 wherein each of the plural digital shifters is a fixedly- 
wired parallel shifter. 

16. A multiples generating circuit according to claim 
12 wherein the summing circuit includes a two-port 
adder/subtractor unit having respective first and sec- 
ond input ports for receiving respective first and second 
term signals from two of said plurality of or-constant 
shifters and having an output port for producing said 
result signal, the two-port adder/subtractor unit further 
having a sign-select control port for receiving a sign- 
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control signal that defines whether the result signal 
represents a sum or difference of the first and second 
term signals. 

17. A multiples generating circuit according to claim 
16 wherein the mapping control means includes a com- 5 
binatorial logic circuit for decoding the supplied index 
signal (B) and responsively outputting the or-constant 
control signals and the sign-control signal. 

18. A multiples generating apparatus for generating a 
succession of physical signals representing a succession 10 
of rational multiples of a value A, where the value A is 
represented by a received, binary-coded multiplicand 
signal (A), said apparatus comprising: 

(a) a plurality of or-constant shifting means, each 
or-constant shifting means receiving said multipli- IS 
cand signal (A) and each receiving one of plural 
shift or-constant control signals, 

(a.1) each of the or-constant shifting means includ- 
ing shift means for shifting the received multipli- 
cand signal (A), and 20 

(a.2) each of the or-constant shifting means further 
including term-producing means, operatively 
coupled to the corresponding shift means, for 
producing, in response to a corresponding shift 
or-constant control signal, a term signal either 25 
representative of the multiplicand (A) multiplied 
by two raised to an integer power or representa- 
tive of a predefined constant; 

(b) summing means, operatively coupled for receiv- 
ing term signals from two or more of said plural 30 
or-constant shifting means and for producing a 
physical result signal representing a sum or differ- 
ence of the received term signals; and 

(c) mapping control means, operatively coupled to 
the plural or-constant shifting means, for output- 35 
ting a succession of the shift or-constant control 
signals to the or-constant shifting means such that 
the result signal produced by the summing means 
represents a succession of rational multiples of the 
value A of the received multiplicand signal (A), 40 
each of said multiples having the form, A-P/Q, 
where Q is an integer equal to two raised to an 
integer power (Q=20 and P is a corresponding one 
of a succession of integers selected from a prede- 
fined series of integers. 45 

19. A multiples generatmg apparatus according to 
claim 18 wherein said summing means includes a carry 
look-ahead adder. 

20. A multiples generating apparatus according to 
claim 18 wherein said summing means includes a carry- 50 
ripple adder. 

21. A multiples generating circuit for generating elec- 
trical signals representing rational multiples of a value 
A, where the value A is defined by a supplied, binary- 
coded multipUcand signal (A), the circuit comprising: 55 

(a) a plurality of or-constant shifters, each receiving 
said multiplicand signal (A) and each receiving one 
of plural, or-constant control signals, each of the 
or-constant shifters including circuitry, responsive 

to a corresponding or-constant control signal for 60 
either shifting the received multiplicand signal (A) 
and producing therefrom a term signal representa- 
tive of the multipUcand (A) multiplied by two 
raised to an integer power or producing a term 
signal representative of a predefined constant; 65 

(b) a simiming circuit, operatively coupled to the 
or-constant shifters for receiving term signals from 
two or more of said plurality of or-constant shifters 
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and for producing a result signal representing a 
sum or difference of the received term signals; and 
(c) an inferential mapping means, operatively coupled 
to the plurality of or-constant shifters, for output- 
ting the or-constant control signals to each of the 
plurality of or-constant shifters such that the result 
signal produced by the summing means represents 
a multiple of the received multiplicand signal (A) 
having the form, A-P/Q, where Q is an integer 
equal to two raised to an integer power (Q =2/) and 
P is an integer selected from a series of integers in 
accordance with a predefined mapping of an infer- 
entially defined variable, B, to P. 

22. A multiples generating circuit according to claim 
21 wherein the inferential mapping means includes a 
cyclical storage means for storing and cyclically output- 
ting a predefined sequence for each of the or-constant 
control signals. 

23. A multiples generating apparatus for generating 
physical signals representing a succession of rational 
multiples of a value A, where the value A is represented 
by a received, binary-coded multiplicand signal (A), 
said apparatus comprising: 

(a) a plurality of or-constant shifting means, each 
or-constant shifting means receiving said multipli- 
cand signal (A) and each receiving one of plural 
shift or-constant control signals, each of the or- 
constant shifting means being for shifting the re- 
ceived multiplicand signal (A) and producing 
therefrom, in response to a corresponding shift 
or-constant control signal, a term signal either rep- 
resentative of the multiplicand (A) multiplied by 
two raised to an integer power or representative of 
a predefined constant; 

(b) summing means, operatively coupled for receiv- 
ing term signals from two or more of said plural 
or-constant shifting means and for producing a 
physical result signal representing a sum or differ- 
ence of the received term signals; and 

(c) inferential mapping means, operatively coupled to 
the plural or-constant shifting means, for output- 
ting a succession of shift or-constant control sig^ials 
to each of the plurality of or-constant shifting 
means such that the result signal produced by the 
summing means represents a succession of multi- 
ples of the received mxiltiplicand signal (A), each 
multiple having the form, A-P/Q, where Q is an 
integer equal to two raised to an integer power 
(Q=20 and p is a corresponding integer selected 
from a succession of integers within a predefined 
range of integers, in accordance with a predefined 
mapping to P of an inferentially-defined, time- 
varying signal, B. 

24. A multiples generating apparatus according to 
claim 23 wherein said time-varying signal, B is periodic 
and the inferential mapping means includes a circulating 
shift register for storing and cyclically outputting a 
predefined sequence for each of the shift or-constant 
control signals. 

25. An integrated circuit comprising: 

(a) a substrate; 

(b) a first bus having at least. n+ 1 lines extending 
along said substrate where n is greater than three, 
at least one of the n-{- 1 lines being a constant-main- 
taining line for maintaining a constant logic level, 
and at least n others of the lines being multiplicand- 
carrying lines for carrying in parallel, n bits of a 
supplied, n-bit wide digital multiplicand signal 
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(Ao:(fl.i)) that represents a succession of multipli- 30. An integrated circuit according to claim 25 

cand values over time; wherein the switching devices include transistors de- 

(c) a second bus having at least six control lines ex- fined in the substrate and wherein the adder/subtractor 
tending along said substrate substantially parallel to unit includes, for each multiplexer cell, a series-con- 
the lines of the first bus, the at least six conuol Imes 5 nected pair of inverters also defmed in the substrate and 
being for carrying in parallel a corresponding set of coupled to the switching device transistors of the corre- 
at least six control signals (laxyJaic); spending multiplexer cell. 

(d) first and second pluralities of multiplexer cells, 31. An integrated circuit according to claim 30 
each multiplexer cell including: wherein the adder/subtractor unit includes, for each 
(d.I) an output node, and lo multiplexer cell, a feedback switch coupling an output 
(d.2) at least three switching devices respectively of one of the corresponding series-connected pair of 

controlled by control signals on a corresponding inverters to an input of the other of the series-connected 

at least three control lines of the second bus, the pair of inverters to thereby define a data latch for stor- 

at least three switching devices havmg inputs ing the output of the corresponding cell output node, 

respectively coupled to two or more lines of the 15 32. A circuit for carrying out machine-implemented 

first bus, one of said two or more lines being a multiplication of respective values represented by a 

constant- m ai ntainin g line and another of said supplied, n-bits wide, multiplicand signal (A) and by a 

two or more lines being a multiplicand-carrying physically real or inferentiaily defined multiplier signal 

line, the at least three switching devices further (B), said circuit comprising: 

having outputs conmaonly coupled to the cell 20 (a) a plurality of barrel shifting units for providing bit 

output node; significance translation of the multiplicand signal 

(e) an adder/subtractor unit having a first term re- (A); 

cciving port operatively coupled to the output (b) a summing unit operatively coupled to the barrel 

nodes of the first plurality of multiplexer cells, a shifting units for adding or subtracting output sig- 

second term receiving port operatively coupled to 25 nals of the barrel shifting units to each other or to 

the output nodes of the second plurality of multi- a supplied constant in accordance with a supplied 

plexer cells, and a sign-defining input, the adder/- control signal; and 

subtracter unit outputting a result signal represent- (c) a mapping control unit operatively coupled to the 
ing, for a first state of the sign-defining input, a sum barrel shifting units and the summing unit for con- 
of first and second term signals respectively output 30 trolling the barrel shifting units and the summing 
by said pluralities of multiplexer cells, and repre- unit such that the simmiing unit produces a result 
senting, for a second state of the sign-defining in- signal that is at least approximately representative 
put, a difference between the first and second term of the value of the multiplicand signal (A) multi- 
signals; and plied by the value of the multiplier signal (B). 

(f) a mapping control unit, operatively coupled to the 35 33. A multiplication circuit according to claim 32 
at least six control lines and to the sign-defining wherein the multiplicand signal (A) is applied in parallel 
input, for defining states of the corresponding at to the plural barrel shifting units. 

least six control signals (laxjhx) and the sign-defin- 34. A multiplication circuit according to claim 32 

ing input so that the result signal output by the wherein each barrel shifting unit responds to a shift 

adder/subtractor unit represents a succession over 40 control signal (I, J) supplied to that shifting unit from the 

time of rational multiples of the succession of multi- mapping control unit and outputs a corresponding term 

plicand values represented by the supplied, n-bit signal (T) representing either a zero or the multiplicand 

wide digital multiplicand signal (Ao:(„.i)). signal (A) shifted by a number of bits defined by the 

26, An integrated circuit according to claim 25, shift control signal ^,J). 

wherein: 45 35. a multiplication circuit according to claim 34 

first and second conductor layers are provided, one wherein resulting term signals (T/, Tj) of respective 

overlapping the other, on said substrate; and barrel shifting units are applied to the summing unit for 

the first and second buses are provided, one overlap- producing a result signal (C) representing sums or dif- 

ping the other, in the respective first and second ferences of the term signals and wherein the mappmg 

conductor layers. 50 control unit determines whether the summing unit per- 

27. An integrated circuit according to claim 26 forms an addition or a subtraction for each said term 
wherein the adder/subtractor unit includes a carry-rip- signal. 

pie adder. 36. A multiplication circuit according to claim 32 

28, An integrated circuit according to claim 26 wherein mapping logic is provided within the mapping 
wherein the switching devices are pass transistors de- 55 control unit for causing the result signal (C) to be a 
fined in the substrate under the first and second buses. binary-coded signal representing a product of the multi- 

29. An integrated circuit according to claim 25 plicand signal (A) and the multiplier signal (B). 
wherein the mapping control unit controls the states of 37. A fimction generating apparatus for generating an 
the control signals (tncJox) and the sign-defining input output signal (C) that at least approximately represents 
so that the result signal output by the adder/subtractor 60 a function output value C= f(A,B) taken out of a prede- 
unit represents a succession over time of rational multi- fined range of output magnitudes, where f(A,B) repre- 
ples each having the form, A-P/Q, where Q is an integer sents a predefined fimction of first and second input 
equal to two raised to an integer power (Q— 2/) and P is values, A and B, said apparatus comprising: 

an integer selected from a series of integers, each P/Q (a) first signal receiving means for receiving a first 

ratio bemg predefmed for a corresponding one or more 65 digital signal (A) representing the first input value, 

of the succession of multiplicand values represented by A; 

the supplied, n-bit wide digital multiplicand signal (b) first approximating-term generating means, opera- 

(Ao:(n-i)). tively coupled to the first signal receiving means 



09/30/2004, EAST version: 1.4.1 



and responsive to a supplied first control signal, for 
generating in response to the first digital signal (A) 
and the first control signal, a first term signal repre* 
senting either a predefined constant or one of a first 
finite set of values distributed across said prede- 5 
fined range of output niagnitudes, said first finite 
set of values being distributed such that the gap 
between any two adjacent ones of the distributed 
first finite set of values is no more than a first prede- 
fined maximuTTi gap amount; 10 

(c) second approzimating-termgenerating means, op- 
eratively coupled to the first signal receiving means 
and responsive to a supplied second control signal, 
for generating in response to the first digital signal 
(A) and the second control signal, a second term 15 
signal representing either a predefined constant or 
one of a second finite set of values distributed 
across a predefined second range of magnitudes, 
said second range bemg approximately at least as 
wide as said first predefined maximum gap amount; 20 

(d) term combining means, operatively coupled to the 
first and second approximating-tenn generating 
means, and responsive to a supplied third control 
signal, for combining the first and second' term 
signals and producing therefrom, in response to the 25 
third control signal, a result signal representative of 
one of a third finite set of values distributed across 
said predefined range of output magnitudes, the 
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third finite set of values being distributed such that 
the gap between any two adjacent ones of the dis- 
tributed third finite set of values is substantially less 
than the first predefined maximum gap amount and 
such that, for a majority of the values in the first 
finite set, each of the same values found in the third 
finite set is surrounded in a generally synmietric 
fashion by neighboring other values of the third 
finite set; and 
(e) mapping means, operatively coupled to the first 
approximating-term generating means, and to the 
second approximating-tenn generating means, and 
to the term combining means, for respectively sup- 
plying said first, second, and third control signals 
such that the result signal is at least approximately 
representative of the fimction output value 
C=f(A,B) for the case where the second input 
value B is either defined by second digital signal 
(B) applied to the mapping means, or inferred from 
the first, second, and third control signals supplied 
by the mapping means. 
38, A function generating apparatus according to 
claim 37 wherein the termcombining means includes 
means for either adding or subtracting the fu^t and 
second term signals in response to the third control 
signal. 
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